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COMPOUNDS 

This invention relates to novel polypeptides and their derivatives which act as 
inhibitors or regulators of complement activation and are of use in the therapy of 
diseases involving complement activation such as various inflammatory and immune 
disorders. 

Constituting about 10% of the globulins in normal serum, the complement system 
is composed of many different proteins that are important in the immune system's 
response to foreign antigens. The complement system becomes activated when its 
primary components are cleaved and the products alone or with other proteins, activate 
additional complement proteins resulting in a proteolytic cascade. Activation of the 
complement system leads to a variety of responses including increased vascular 
permeability, chemotaxis of phagocytic cells, activation of inflammatory cells, 
opsonization of foreign particles, direct killing of cells and tissue damage. Activation of 
the complement system may be triggered by antigen-antibody complexes (the classical 
pathway) or, for example, by lipopolysaccharides present in cell walls of pathogenic 
bacteria (the alternative pathway). 

Complement activation (CA) is known to occur in a wide variety of acute 
inflammatory processes particularly those associated with ischaemia and reperfusion 
injury (Rossen et al., 1985 Circ. Res., 57, 119,; Morgan B.P., 1990 The biological effects 
of complement activation. In 'Complement, Clinical Aspects and Relevance to Disease'. 
Academic Press. London.) 

It is also generally accepted that at least some of the components of the classical 
complement cascade can be detected by immunohistochemical methods in close 
association with senile plaques in the brains of sufferers from Alzheimer's disease 
(Eikelenboom et al., 1994, Neuroscience, 59, 561-568) and that complement activation 
plays a role in the inflammatory component of this condition.. 

Complement receptor type 1 (CR1) has been shown to be present on the 
membranes of erythrocytes, monocytes/macrophages, granulocytes, B cells, some T cells, 
splenic follicular dendritic cells, and glomerular podocytes. CR1 binds to the 
complement components C3b and C4b and has also been referred to as the C3b/C4b 
receptor. The structural organisation and primary sequence of one allotype of CR1 is 
known (Klickstein etal. t 1987, J. Exp. Med. 165:1095-1112, Klickstein et aL, 1988, J. 
Exp. Med. 168:1699-1717; Hourcade et aL, 1988, J. Exp. Med. 168:1255-1270, WO 
89/09220, WO 91/05047). It is composed of 30 short consensus repeats (SCRs) that each 
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contain around 60-70 amino acids. In each SCR, around 29 of the average 65 amino 
acids are conserved. Each SCR has been proposed to form a three dimensional triple 
loop structure through disulfide linkages with the third and first and the fourth and 
second half-cystines in disulphide bonds. CR1 is further arranged as 4 long homologous 
repeats LHRs) of 7 SCRs each. Following a leader sequence, the CR1 molecule consists 
of the N-termmal LHR-A, the next two repeats, LHR-B and LHR-C, and the most C- 
terminal LHR-D followed by 2 additional SCRs, a 25 residue putative transmembrane 
region and a 43 residue cytoplasmic tail. 

Based on the mature CR1 molecule having a predicted N-terminal glutamine 
residue, hereinafter designated as residue 1, the first four SCR domains of LHR-A are 
defined herein as consisting of residues 2-58, 63-120, 125-191 and 197-252, respectively 
of mature CR1. F y * 

Hourcade et a/.,1988. J. Exp. Med. 168:1255-1270 observed an alternative 
polyadenylation site in the human CR1 transcriptional unit that was predicted to produce 

Lr 8 r Ts d c f r f °™ L t mRNA encoded by this truncated sequence C « ^ 

first 8.5 SCRs of CR1, and encodes a protein of about 80 kDa which was proposed to 
include the C4b binding domain. When a cDNA corresponding to this truncated 
sequence was transfected into COS cells and expressed, it demonstrated the expected C4b 
binding activity but did not bind to C3b (Krych et a/., 1989, FASEB J. 3:A368- Krych et 
al. Proc. Nat. Acad. Sci. 1991, 88, 4353-7). Krych et al., also observed a mRNA similar 
to the predicted one in several human cell lines and postulated that such a truncated 
soluble form of CR1 with C4b binding activity may be synthesised in humans 

In addition, Makrides et al. (1992, J. Biol. Chem. 2£7 (34) 24754-61) have 
expressed SCR 1 + 2 and 1 + ? + ^ + 4 nf t hd a _ 

nuo i * ~ 5 + 4 of LHR " A as membrane-attached proteins in 

CHO cells. 

Several soluble fragments of CR1 have also been generated via recombinant DNA 
procedures by eliminating the transmembrane region from the DNAs being expressed 
^0 89709220, WO 91/05047). The so.uble CR1 fragments were functi^al^ 
bound C3b and/or C4b and demonstrated Factor I cofactor activity depending upon the 
regions they contained. Such constructs inhibited /„ vitro complement-related functions 
such as neutrophil oxidative burst, complement mediated hemolysis, and C3a and C5a 
production. A particular soluble construct, sCRl/pBSCRlc, also demonstrated in vivo 
activity in a reversed passive Arthus reaction (WO 89/09220, WO 91/05047- Yeh et al 
1991, J. Immunol. 146:250), suppressed post-ischemic myocardial inflammation and " 
necrosis (WO 89/09220, WO 91/05047; Weisman et al., Science, 1990 249 146 1511- 
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Dupe, R. et al. Thrombosis & Haemostasis (1991) 65(6) 695.) and extended survival 
rates following transplantation (Pruitt & Bollinger, 1991, J^Surg. Res 50:350; Pruitt et 
al } 1991 Transplantation 52; 868). Furthermore, co-formulation of sCRl/pBSCRlc 
with p-anisoylated human plasminogen-streptokinase-activator complex (APS AC) 
resulted in similar anti-haemolytic activity as sCRl alone, indicating that the combination 
of the complement inhibitor sCRl with a thrombolytic agent was feasible (WO 
91/05047). 

In a model of antibody-mediated demyelinating experimental allergic 
encephalomyelitis (ADEAE), systemic inhibition of CA using sCRl over 6 days, 
produced improvements in clinical score and blocked CNS inflammation, demyelination 
and deposition of complement components (Piddlesden et aL, 1994, J. Immunol. 152, 
5477). ADEAE can be regarded as a model of acute relapse in multiple sclerosis (MS) 
and these striking results suggested possible applications for sCRl in MS therapy despite 
the high molecular weight (245 kilodaltons) of this agent. 

In a rat model of traumatic brain injury, complement inhibitor sCRl (also known 
as TP10 or BRL55730) was shown to reduce myeloperoxidase activity (an indicator of 
neutrophil accumulation) following traumatic injury (Kaczorowska et al, 1995, J. 
Cerebral Blood Flow and Metabolism, 15, 860-864). This is suggested as demonstrating 
that complement activation is involved in the local inflammatory response. 

Soluble polypeptides corresponding to part of CR1 having functional complement 
inhibitory, including anti-haemolytic activity, have been described in WO94/00571 
comprising, in sequence, one to four short consensus repeats (SCR) selected from SCR 1, 
2, 3 and 4 of long homologous repeat A (LHR-A) as the only structurally and 
functionally intact SCR domains of CR1 and including at least SCR3. 

Pseudogenes are usually defined as DNA sequences which possess a high degree 
of homology to genes with identified function but which are not expressed. The origins 
of the lack of transcription and translation vary but are commonly the presence of 
accumulated mutations which inactivate transcriptional initiation sites, disrupt RNA 
splicing or introduce frame-shift mutations and premature termination codons. 
Pseudogenes are sometimes regarded as genetic relics which have been isolated within 
the genome through a primary loss of expressability and which have subsequently 
mutated randomly in situ to highly aberrant forms. There is a frequent presumption that 
pseudogene sequences, if expressable at all, will not be functionally active because of an 
accumulation of deleterious in-frame mutations. However, studies of immune system 
genetics suggest that pseudogenes may act as a source of diversity in somatic mutation 
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processes and that non-expressed sequences may recombine with normally expressed 
genes to create functional variants with a conserved framework. This phenomenon has 
been documented m immunoglobulin VL and VH genes and elsewhere 
(W.T.McCormack et al, Genes Dev. 4, 548-58, 1990). 

The creation of pseudogenes through reverse transcription followed by DNA 
Ration is also known. In such cases, the integrated sequences (which can in principle 
originate from organisms other than the host) lack introns and may be sited in 
chromosomal locations distant from the expressed gene to which they are homologous 
because integration into the genome can occur at random site, Such genes are known as 
processed pseudogenes. The presence of pseudogenes in chromosomal clusters with 
homologous expressed genes argues against them being processed pseudogenes because 
of the improbahdity of a random integration process giving rise to close physical 
clustering in a large genome. 

p The existence of a gene homologous to that for complement receptor type 1 
CR1) was first reported by Hourcade et al. (J.Biol.Chem, 275, 974-80, 1990) who 
found it associated with a gene cluster on chromosome lq 32 termed the Regulators of 
Complement Activation (RCA) cluster. The latter comprises the CR1 gene itself and 
those encoding decay accelerating factor (DAF), membrane cofactor protein (MCP) 
Factor H, complement receptor type 2 (CR2) and C4 binding protein. This 'CRl-like' 
gene was predicted to encode a protein containing seven SCR regions corresponding (by 
closest omo ogy) to SCRs 1-6 and 9 of LHR-A (1-6) or LHR-B (9) of CR1 itself. The 
overall homology with the above regions of CR1 at the predicted amino acid level is 91% 
and the sequence divergence is greatest in the first three SCRs. 

CR1 " LIKE GENE (CrlPSe) WITH ™ HUMAN CR1 SEQUENCE 
(Lrl.Pe, LHRA region). 

(NB. CR1 numbering includes the signal sequence) 

80.7% identity in 446 aa overlap 
Crlpse 

QCNVPEWLPFARPTNLTDDFEFPIGTYLNY 
M : I I M M M I I I M | : | | | | f | | | , j , 



Crl . Pe VGPPAPGLPF' 

40 50 60 7 0 



CCGGSLLAWVLLALPVAWGQCNAPEWLPFARPTNLTDEFEFPIGTYLNY 
20 30 
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40 50 60 70 80 90 

Crlpse ECRPGYSGRPFSIICLKNSVWTSAKDKCKRKSCRNPPDPVNGMAHVIKDIQFRSQIKYSC 
I I I I I I I I I I I I M I M I I I [ I : I I I : I : I I I I I I I I I I I I I I : M I I : I I I I I I I I I I 
Crl . Pe ECRPGYSGRPFSIICLKNSVWTGAKDRCRRKSCRNPPDPVNGMVHVIKGIQFGSQIKYSC 
80 90 100 110 120 130 

100 110 120 130 140 150 

Crlpse PKGYRLIGSSSATCI ISGNTVIWDNKTPVCDRI ICGLPPTI ANGDFTSISREYFHYGSVV 
: I I I I I I I I i I I I I I I II : I I I I I I : I I : I I I I I I I I I I I : I I I I : I : : II (Mill! 
Crl . Pe TKGYRLIGSSSATCI ISGDTVIWDNETPICDRIPCGLPPTITNGDFI STNRENFHYGSW 
140 150 160 170 180 190 

160 170 180 190 200 210 

Crlpse TYKCNLGGRGKKv r FELVGEPSIYCTSKDDQVGIWSGPAPQCIIPNKCTPPNVENGILVSD 
11:11 II I : I I I I I I I I I I I I I I I : M M i I I I M I I I I I I I I I I I II I I I I I I II I I 
Crl . Pe TYRCNPGSGGRKVFELVGEPS I YCTSNDDQVGIWSGPAPQCI IPNKCTPPNVENGILVSD 
200 210 220 230 240 250 

220 230 240 250 260 270 

Crlpse NRSLFSLNEWEFRCQPGFGMKGPSHVKCQALNKWEPELPSCSRVCQPPPDVLHAERTQR 
M I I I M I I I I I II I I M I I I II : : I I I I I I I 1 I I II I I I M I I I I I I I J I I I I I I I I I 
Crl . Pe NRSLFSLNEWEFRCQPGFVMKGPRRVKCQALNKWEPELPSCSRVCQPPPDVLHAERTQR 
260 270 280 290 300 310 

280 290 300 310 320 330 

Crlpse DKDNFSPGQEVF YSCEPGYDLRGSTYLHCTPQGDWSPAAPRCEVKSCDDFLGQLPNGHVL 
I 11 I I II I I I I I I I I I I I I I I I I : : : : I I I M I I M I I I II II I I I I I : I I ! M : M 
Crl . Pe DKDNFSPGQEVFYSCEPGYDLRGAASMRCTPQGDWSPAAPTCEVKSCDDFMGQLLNGRVL 
320 330 340 350 360 370 

340 350 360 370 380 390 

Crlpse FPLNLQLGAKVDFVCDEGFQLKGSSASYCVLAGMESLWNSSVPVCERESCKTPPVPVNGM 
I I : II I I I I I II I I I II I I I I II I I I I I I I I I I I I M I I I I II II I : I : I I I II : 
Crl . Pe FPVNLQLGAKVDFVCDEGFQLKGSSASYCVLAGMESLWNSSVPVCEQI FCPSPPVI PNGR 
380 390 400 410 420 430 
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400 410 420 430 

Crlpse — VHVITDIHVGSRINYSC TTGHRLIGHSSAECIL-SGNTAHWSMKPPICQ 

: : : : : I : : I I : I : I : I I I : I : | : : : | | : | | 

Crl -Pe HTGKPLEVFPFGKAVNYTCDPHPDRGTSFDLIGESTIRCTSDPQGNGVWSSPAPRCG 

440 4 50 460 470 480 490 

The CRl-like gene also encodes a signal peptide but not a transmembrane or 
cytoplasmic region and contains an intron-exon structure similar to the CR1 gene. There 
is currently no evidence that the CRl-like gene is expressed and neither mRNA 
transcripts nor a soluble protein have been isolated. The origin of the CRl-like sequence 
may lie in a gene duplication event in an ancestral CR1 gene which was followed by 
divergence and transcriptional inactivation of the CRl-like gene. It appears probable that 
the CRl-like gene is currently a pseudogene although not of. the processed type. 

It has now been found that replacement of codons in DNA encoding the first three 
SCRs of LHR-A of CR1 with others encoding the predicted aminoacids in the CRl-like 
sequence can give rise to chimeric genes which can be expressed to give active 
complement inhibitors with functional complement inhibitory, including anti-haemolytic, 
activity. 

According to the present invention there is provided a soluble polypeptide 
comprising, in sequence, one to four short consensus repeats (SCR) selected from SCR 1, 
2, 3 and 4 of long homologous repeat A (LHR-A) as the only structurally and 
functionally intact SCR domains of CR1 and including at least SCR3, in which one or 
more of the native amino acids are substituted with the following: 
Val 4, Asp 19, Ser 53, Lys 57, Ala 74, Asp 79, Arg 84, Pro 91, Asn 109, Lys 1 16, Val 
119, Ala 132, Thr 137, He 139, Ser 140, Tyr 143, His 153, Leu 156, Arg 159, Lys 161, 
Lys 177, Gly 230, Ser 235, His 236. 

(Numbering is from glutamine as residue 1 of mature CRL The amino-acid indicated is 
that which replaces the CR1 residue at the position specified.) 

In preferred aspects, the polypeptide comprises, in sequence, SCR 1, 2, 3 and 4 
of LHR-A or SCR 1, 2 and 3 of LHR-A as the only structurally and functionally intact 
SCR domains of CR1 with the modification(s) described above. 

It is to be understood that additional variations in the amino acid sequence of the 
polypeptide of the invention by way of addition, deletion or conservative substitution of 
residues, including allelic variations, in which the biological activity of the polypeptide is 
retained, are encompassed by the invention. Conservative substitution is understood to 
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mean the retention of the charge and/or size characteristics of the amino acid side chain, 
for example arginine replaced by lysine or glutamine . _ 

In one aspect, the polypeptide of the invention may be represented symbolically 
as follows: 

NH 2 -V 1 -SCRl-W 1 -SCR2-X 1 -SCR3-Y 1 -OH (I) 
in which SCR1 represents residues 2-58 of mature CR1, SCR2 represents residues 63- 
120 of mature CR1, SCR3 represents residues 125-191 of mature CR1, and containing at 
least one of the substitutions as aforesaid and V 1 , W 1 , X 1 and Y* represent bonds or 
short linking sequences of amino acids, preferably 1 to 5 residues in length and which are 
preferably derived from native interdomain sequences in CRL 

The native interdomain sequences in CR1 may also be substituted with the 
corresponding predicted aniinoacids in the CR1 -like sequence, namely Lys59 and/or 
He 124. (Numbering is from glutamine as residue 1 of mature CRL The amino-acid 
indicated is that which replaces the CR1 residue at the position specified.) 

In a preferred embodiment , the SCR3 domain of formula (I) is substituted with 
all- ten residues found in the corresponding pseudogene sequence, namely (in single letter 
code): 

A132, T137, 1139, S140, Y143, H153, L156, R159, K161, K177 (Sequence Group 1) 
and the remaining domains have the sequence of mature CRL 

In a further preferred embodiment of formula (I), W*, X^ and Y* represent 
residues 59-62, 121-124 and 192-196, respectively, of mature CR1, optionally substituted 
as aforesaid and V* represents residue 1 of mature CR1 optionally linked via its N- 
terminus to methionine. 

In another aspect the polypeptide of the invention may be represented 
symbolically as follows: 

NH 2 -V 2 -SCRl-W 2 -SCR2-X 2 -SCR3-Y 2 -SCR4-Z 2 OH (II) 
in which SCR1, SCR2 and SCR3 are as hereinbefore defined, SCR4 represents residues 
197-252 of mature CR1 and containing at least one of the substitutions as aforesaid, and 
V 2 , W 2 , X 2 , Y 2 and Z 2 represent bonds or short linking sequences of amino acids, 
preferably 1 to 5 residues in length and which are preferably derived from native 
interdomain sequences in CR1, optionally substituted as aforesaid. 

In a preferred embodiment of formula (II), the SCR3 region is substituted with 
the aforementioned Sequence Group 1 residues and the remaining domainshave the 
sequence of mature CRL 
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In further preferred embodiments of formula (II), W2 X \ Y 2 and Z2 represent 
residues 59-62, 121-124, 192-196, and residues 253 respectively, of mature CRI 
option^ su5 ^ as aforesaid> and y2 represpms resid . e i ^ mafure ^ 

linked via its N-terminus to methionine. * 

In one particular embodiment of formula (II) arginine 235 is replaced by 
histidine. J 

In the preferred embodiment of formula (II), residue 235 is arginine 
In one further aspect, the polypeptide of the invention may be represented 
symbolically as follows: 

NH 2 -X3-SCR3-Y3-OH (m) 
in which SCR3 is as hereinbefore defined, containing at least one of the substitutions as 
aforesaid, and in a preferred embodiment, all those of Sequence Group 1, and X 3 and Y 3 
represent bonds or short linking sequences of amino acids, preferably 1 to 5 residues in 
length and which are preferably derived from native interdomain sequences in CR1 
optionally substituted as aforesaid. 

In 3 further I™*"* embodiment of formula (III) X 3 repre sents amino acids 
122-124 of mature CR1, optionally substituted as aforesaid, optionally linked to 
methionine at its N-terminus and Y 4 represents amino acids 192-196 of mature CR1 

In another further aspect, the polypeptide of the invention may be represented 
symbolically as follows: 

NH 2 -X4-SCR3-Y4-SCR4-Z4-OH (IV ) 
in which SCR3 and SCR4 are as hereinbefore defined containing at least one of the 
substitutions as aforesaid and X 4 Y 4 and Z 4 represem bonds Qr ^ 
of ammo acids, preferably 1 to 5 residues in length and which are preferably derived 
from native interdomain sequences in CR1, optionally substituted as aforesaid 

In a preferred embodiment of formula (IV), the SCR3 region is substituted with 
the residues of Sequence Group 1 and the remaining domains have the sequence of 
mature CRI. 

In a further preferred embodiment of formula (IV) X 4 represents amino acids 
1-2-124 of mature CRI, optionally substituted as aforesaid, optionally linked to 
methionine at its N-terminus and Y 4 and Z 4 represent amino acids 192-196 and 253 
respectively of mature CRI. 
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The soluble polypeptides of the invention lack the membrane binding 
capability of the full length CR1 proteins which properties jnay be advantageous to the 
therapeutic activity. 

The main classes of interaction of proteins with membranes can be summarised as 
follows: 

1. Direct and specific interactions with phospholipid head groups or with 
other hydrophilic regions of complex lipids or indirectly with proteins already inserted in 
the membrane. The latter may include all the types of intrinsic membrane protein noted 
below and such interactions are usually with extracellular domains or sequence loops of 
the membrane proteins; 

2. Through anchoring by a single hydrophobic transmembrane helical region 
near the teirninus of the protein. These regions commonly present a hydrophobic face 
around the entire circumference of the helix cylinder and transfer of this structure to the 
hydrophilic environment of bulk water is energetically unfavourable. 

3. Further anchoring is often provided by a short sequence of generally 
cationic aminoacids at the cytoplasmic side of the membrane, C-terminal to the 
transmembrane helix. The membrane-binding properties of CR1 are provided by features 
2 and 3. 

4. Through the use of multiple (normally 2-12 and commonly 4,7 and 10) 
transmembrane regions which are usually predicted to be helical or near-helical. 
Although these regions are normally hydrophobic overall, they frequently show some 
amphipathic behaviour - an outer hydrophobic face and an inner more hydrophilic one 
being identifiable within a helix bundle located in the lipid bilayer; 

5. Through postranslationally linked phosphatidyl inositol moieties (GPI- 
anchors). These are generated by a specific biosynthetic pathway which recognises and 
removes a specific stretch of C-terminal aminoacids and creates a membrane-associating 
diacyl glycerol unit linked via a hydrophilic carbohydrate spacer to the polypeptide; 

6. In a related process, single fatty acid groups such as myristoyl, palmitoyl 
or prenyl may be attached postranslationally to one or more sites in a protein (usually at 
N- or C-termini). Again, amino acids. (such as the C-terminal CAAX box in Ras proteins) 
may be removed. 

The present invention further provides a soluble derivative of the soluble 
polypeptide of the invention, said derivative comprising two or more heterologous 
membrane binding elements with low membrane affinity covalently associated with the 
polypeptide which elements are capable of interacting independently and with 
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thermodynamic additivity with components of cellular membranes exposed to 
extracellular fluids. 

By "heterologous' is meant that the elements are not found in the native full 
length CR1 protein. 

By 'membrane binding element with low membrane affinity' is meant that the 
element has significant affinity for membranes, that is a dissociation constant greater 
thanlu.M, preferably ljiM-lmM. The elements preferably have a size <5kDa. 

The derivative should incorporate sufficient elements with low affinities for 
membrane components to result in a derivative with a high (preferably 0.01 - lOnM 
dissociation constant) affinity for specific membranes. The elements combine so as to 
create an overall high affinity for the particular target membrane but the combination 
lacks such high affinity for other proteins for which single elements may be (low- 
affinity) ligands. 

The elements should be chosen so as to retain useful solubility in pharmaceutial 
formulation media, preferably >100^g/ml. Preferably at least one element is hydrophilic. 

The further embodiment of the invention thus promotes localisation of the 
polypeptide of the invention at cellular membranes and thereby provide one or more of 
several biologically significant effects with potential therapeutic advantages including: 
Potency: an increase in effective concentration may result from the reduction 
in the diffusional degrees of freedom. 

Pharmacokinetics and dosing frequency: Interaction of the derivatised 
polypeptide with long-lived cell types or serum proteins would be expected to prolong 
the plasma residence time of the polypeptide and produce a depot effect through 
deposition on cell surfaces. 

Specificity: Many clinically important pathological processes are associated 
with specific cell types and tissues (for example the vascular endothelium and the 
recruitment thereto of neutrophils bearing the sialyl Lewis ' antigen to ELAM-1 see 
below).. Hence targeting the modified polypeptide to regions of membrane containing 
pathology-associated membrane markers may improve the therapeutic ratio of the protein 
targeted. 

It will be appreciated that all associations of heterologous amino acid sequences 
with the polypeptide will need to be assessed for potential immunogenic^, particularly 
where the amino acid sequence is not derived from a human protein. The problem can be 
minimised by using sequences as close as possible to known human ones and through 
computation of secondary structure and antigenicity indices. 
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The derivative preferably comprises two to eight, more preferably two to four 
membrane binding elements. _ 

Membrane binding elements are preferably selected from: fatty acid derivatives 
such as fatty acyl groups; basic amino acid sequences; ligands of known integral 
membrane proteins; sequences derived from the complementarity-determining region of 
monoclonal antibodies raised against epitopes of membrane proteins; membrane binding 
sequences identified through screening of random chemical libraries. 

The selection of suitable combination of membrane binding elements will be 
guided by the nature of the target cell membrane or components thereof. 

Suitable fatty acid derivatives include myristoyl (12 methylene units) which is 
insufficiently large or hydrophobic to permit high affinity binding to membranes. 
Studies with ■my-ri-stoy-lated peptides (eg R.M.Peitzsch & S.McLaughlin, Biochemistry, 
32, 10436-10443, 1993)) have shown that they have effective dissociation constants with 
model lipid systems of ~10~ 4 M and around 10 of the 12 methylene groups are buried in 
the lipid bilayer. Thus, aliphatic acyl groups with between about 8 and 18 methylene 
units, preferably 10-14, are suitable membrane binding elements. Other examples of 
suitable fatty acid derivatives include long-chain (8-18, preferably 10-14 methylene) 
aliphatic amines and thiols, steroid and farnesyl derivatives. 

Membrane binding has been found to be associated with limited (single-site) 
modification with fatty acyl groups when combined with a cluster of basic aminoacids in 
the protein sequence which may interact with acidic phospholipid head groups and 
provide the additional energy to target membrane binding. This combination of effects 
has been termed the 'myristoyl-electrostatic switch' (S. McLaughlin and A.Aderem, TIBS, 
20,272-276, 1994; J.F.Hancock et aU Cell, 63, 133-139,1990). Thus, a further example 
of suitable membrane binding elements are basic aminoacid sequences such as those 
found in proteins such as Ras and MARCKS (myristoylated alanine-rich C-kinase 
substrate, PJ. Blackshear, J. Biol. Chem., 268, 1501-1504, 1993) which mediate the 
electrostatic 'switch' through reversible phosphorylation of serine residues within the 
sequence and a concomitant neutralisation of the net positive charge. Such sequences 
include but are not restricted to consecutive sequences of Lysine and Arginine such as 
(Lys)n where n is between 3 and 10, preferably 4 to 7. 

Suitable examples of amino acid sequences comprising basic amino acids include: 

i) DGPKKKKKKSPSKSSG 

ii) GSSKSPSKKKKKKPGD 

iii) SPSNETPKKKKKRFSFKKSG 
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(N-terminus on left) 

Sequences i) to iii) are examples of electrostatic switch sequences. 

Examples of amino acid sequences include RGD-containing peptides such as 
GRGDSP which are ligands for the o, ib (3 3 integrin of human platelet membranes. Further 
examples of such sequences include those known to be involved in interactions between 
membrane proteins such as receptors and the major histocompatibility complex. An 
example of such a membrane protein ligand is the sequence GNEQSFRVDLRTLLRYA 
which has been shown to bind to the major histocompatibility complex class 1 protein 
(MHC-1) with moderate affinity (L.Olsson et al, Proc. Natl .Acad.Sci.USA. 91 9086- 
909, 1994). 

An example of a ligand for an integral membrane protein is the carbohydrate 
ligand Sialyl Lewis' which has been identified as a ligand for the integral membrane 
protein ELAM-1 (M.L.Phillips et al, Science, 250, 1 130-1132, 1990 & G.Walz et al, 
250, 1132-1135,1990). 
Sequences derived from the complementarity-determining regions of monoclonal 
antibodies raised against epitopes within membrane proteins (see, for example, 
J.W.Smith et al, J.Biol.Chem. 270, 30486-30490, 1995) are also suitable membrane 
binding elements, as are binding sequences from random chemical libraries such as those 
generated in a phage display format and selected by biopanning operations in vitro 
(G.F.Smith and J.K.Scott, Methods in Enzymology, 217H, 228-257,1993) or in vivo 
(R.Pasqualini & E.Ruoslahti, Nature, 380, 364-366, 1996). 

Optionally, conditional dissociation from the membrane may be incorporated 
into derivatives of the invention using mechanisms such as pH sensitivity (electrostatic 
switches), regulation through metal ion binding (using endogenous Ca 2 \ Zn 2 * and 
incorporation of ion binding sites in membrane binding elements) and protease cleavage 
(e.g plasminolysis of lysine-rich membrane binding sequences to release and activate 
prourokinase) 

Preferred derivatives of this invention have the following structure: 
[P]-{L-[W]} n -X 

in which: 

P is the soluble polypeptide, 

each L is independently a flexible linker group, 

each W is independently a peptidic membrane binding element, 

n is an integer of 1 or more and 
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X is a peptidic or non-peptidic membrane-binding entity which may be covalently 
linked to any W. _ 

Peptidic membrane binding elements are preferably located sequentially either at 

the N or C terminus of the soluble polypeptide and are preferably 8 to 20 amino acids 

i. 

long. The amino acid sequences are linked to one another and to the soluble peptide by 
linker groups which are preferably selected from hydrophilic and/or flexible aminoacid 
sequences of 4 to 20 aminoacids; linear hydrophilic synthetic polymers; and chemical 
bridging groups. 

In a further aspect, the invention provides a process for preparing a polypeptide 
according to the invention which process comprises expressing DNA encoding said 
polypeptide in a recombinant host cell and recovering the product. 

In particular, the process may comprise the steps of: 

i) preparing a replicable expression vector capable, in a host cell, of expressing a 
DNA polymer comprising a nucleotide sequence that encodes said polypeptide; 

ii) transforming a host cell with said vector; 

iii) culturing said transformed host cell under conditions permitting expression of 
said DNA polymer to produce said polypeptide; and 

iv) recovering said polypeptide. 

The DNA polymer comprising a nucleotide sequence that encodes the polypeptide 
also forms pan of the invention. 

The process of the invention may be performed by conventional recombinant 
techniques such as described in Sambrook et al, Molecular Cloning : A laboratory 
manual 2nd Edition. Cold Spring Harbor Laboratory Press (1989) and DNA Cloning 
vols I, II and III (D. M. Glover ed., IRL Press Ltd). 

The invention also provides a process for preparing the DNA polymer by the 
condensation of appropriate mono-, di- or oligomeric nucleotide units. 

The preparation may be carried out chemically, enzymatically, or by a 
combination of the two methods, in vitro or in vivo as appropriate. Thus, the DNA 
polymer may be prepared by the enzymatic ligation of appropriate DNA fragments, by 
conventional methods such as those described by D. M. Roberts et al., in Biochemistry 
1985, 24, 5090-5098. 

The DNA fragments may be obtained by digestion of DNA containing the 
required sequences of nucleotides with appropriate restriction enzymes, by chemical 
synthesis, by enzymatic polymerisation, or by a combination of these methods. 
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Digestion with restriction enzymes may be performed in an appropriate buffer at a 
temperature of 20°-70°C, generally in a volume of 50ul or less with 0. 1-lOjig DNA 
Enzymatic polymerisation of DNA may be carried out in vitro using a DNA 
polymerase such as DNA polymerase 1 (Klenow fragment) in an appropriate buffer 
containing the nucleoside triphosphates dATP , dCTP, dGTP and dTTP as required at a 
temperature of 10°-37°C, generally in a volume of 50^1 or less. 

Enzymatic ligation of DNA fragments may be carried out using a DNA ligase 
such as T4 DNA ligase in an appropriate buffer at a temperature of 4°C to 37°C 
generally in a volume of 50|xl or less. 

The chemical synthesis of the DNA polymer or fragments may be carried out by 
conventional phosphotriester, phosphite or phosphoramidite chemistry, using solid phase 
techniques such as those described in 'Chemical and Enzymatic Synthesis of Gene 
Fragments - A Laboratory Manual' (ed. H.G. Gassen and A. Lang), Verlag Chemie 
Weinheim (1982), or in other scientific publications, for example M.J.Gait H W D 
Matthes M. Singh, U.S. Sproat and R.C. Titmas, Nucleic Acids Research, 1982 10 
6243; B.S. Sproat and W. Bannwarth, Tetrahedron Letters, 1983 24 5771- M D ' 
Matteucci and M.H. Caruthers, Tetrahedron Letters, 1980, 21, 719; M.D. Matteucci and 
M.H. Caruthers, Journal of the American Chemical Society, 1981, 103 3185- S P 
Adams et al., Journal of the American Chemical Society, 1983, m 661; N.D. Sinha J 
Biernat, J. McMannus and H. Koester, Nucleic Acids Research, 1984, 12 4539- and ' ' 
H.W.D. Matthes et al., EMBO Journal, 1984, 2, 801. Preferably an automated DNA 
synthesiser (for example, Applied Biosystems 381 A Synthesiser) is employed. 

The DNA polymer is preferably prepared by ligating two or more DNA 
molecules which together comprise a DNA sequence encoding the polypeptide 

The DNA molecules may be obtained by the digestion with suitable restriction 
enzymes of vectors carrying the required coding sequences. 

The precise structure of the DNA molecules and the way in which they are 
obtained depends upon the structure of the desired product. The design of a suitable 
strategy for the construction of the DNA molecule coding for the polypeptide is a routine 
matter for the skilled worker in the art. 

In particular, consideration may be given to the codon usage of the particular host 
cell. The codons may be optimised for high level expression in E. coli usin* the 
principles set out in Devereux et al., (1984) Nucl. Acid Res., 12, 387. 

The expression of the DNA polymer encoding the polypeptide in a recombinant 
host cell may be carried out by means of a replicable expression vector capable in the 
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host cell, of expressing the DNA polymer. The expression vector is novel and also forms 
part of the invention. _ 

The replicable expression vector may be prepared in accordance with the 
invention, by cleaving a vector compatible with the host cell to provide a linear DNA 
segment having an intact replicon, and combining said linear segment with one or more 
DNA molecules which, together with said linear segment, encode the polypeptide, under 
ligating conditions. 

The ligation of the linear segment and more than one DNA molecule may be 
carried out simultaneously or sequentially as desired. 

Thus, the DNA polymer may be preformed or formed during the construction of 
the vector, as desired. The choice of vector will be determined in part by the host cell, 
which may be prokaryotic, such as E. coli, or eukaryotic, such as mouse CI 27, mouse 
myeloma, Chinese hamster ovary, fungi e.g. filamentous fungi or unicellular 'yeast* or an 
insect cell such as Drosophila. The host cell may also be in a transgenic animal. Suitable 
vectors include plasmids, bacteriophages, cosmids and recombinant viruses derived from, 
for example, baculoviruses or vaccinia. 

The DNA polymer may be assembled into vectors designed for isolation of stable 
transformed mammalian cell lines expressing the fragment e.g. bovine papillomavirus 
vectors in mouse C127 cells, or amplified vectors in Chinese hamster ovary cells (DNA 
Cloning Vol. II D.M. Glover ed. IRL Press 1985; Kaufman, RJ. et aL. Molecular and 
Cellular Biology 5, 1750-1759, 1985; Pavlakis G.N. and Hamer, D.H. Proceedings of 
the National Academy of Sciences (USA) 80, 397-401, 1983; Goeddel, D.V. et 
<2/.,European Patent Application No. 0093619, 1983). 

The preparation of the replicable expression vector may be carried out 
conventionally with appropriate enzymes for restriction, polymerisation and ligation of 
the DNA, by procedures described in, for example, Sambrook et a/.,cited above. 
Polymerisation and ligation may be performed as described above for the preparation of 
the DNA polymer. Digestion with restriction enzymes may be performed in an 
appropriate buffer at a temperature of 20°-70°C, generally in a volume of 50jal or less 
with 0.1-10|Lig DNA. 

The recombinant host cell is prepared, in accordance with the invention, by 
transforming a host cell with a replicable expression vector of the invention under 
transforming conditions. Suitable transforming conditions are conventional and are 
described in, for example, Sambrook et al. } cited above, or "DNA Cloning" Vol. II, D.M. 
Glover ed., IRL Press Ltd, 1985. 
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The choice of transforming conditions is determined by the host cell. Thus, a 
bacterial host such as E.coli, may be treated with a solution of CaCl 2 (Cohen et a/.,Proc. 
Nat. Acad. Sci., 1973, 69, 2110) or with a solution comprising a mixture of RbCl, 
MnCl2, potassium acetate and glycerol, and then with 3-[N-morpholino]-propane- 
sulphonic acid, RbCl and glycerol or by electroporation as for example described by Bio- 
Rad Laboratories, Richmond, California, USA, manufacturers of an electroporator. 
Mammalian cells in culture may be transformed by calcium co-precipitation of the vector 
DNA onto the cells or by using cationic liposomes. 

The invention also extends to a host cell transformed with a replicable expression 
vector of the invention. 

Culturing the transformed host cell under conditions permitting expression of the 
DNA polymer is carried out conventionally, as described in, for example, Sambrook et 
aL, and "DNA Cloning" cited above. Thus, preferably the cell is supplied with nutrient 
and cultured at a temperature below 45°C. 

The protein product is recovered by conventional methods according to the host 
cell. Thus, where the host cell is bacterial such as E. coli and the protein is expressed 
intracellular^, it may be lysed physically, chemically or enzymatically and the protein 
product isolated from the resulting lysate. Where the host cell is mammalian, the product 
is usually isolated from the nutrient medium. 

Althought the protein expressed by a plasmid encodes a specified single 
polypeptide post translational modification may lead to an altered product. During both 
the culture phase and the isolation phase the target protein is exposed to a number of 
processes that may have an affect on the precise nature of the polypeptide. For example, 
proteases present in the culture fluid may lead to limited proteolytic changes in the target 
protein. These changes might be internal to the polypeptide chain, leading to conversion 
from a one chain to a two (or more) chain molecule, or terminal where one or more 
residues at the N-terminus or the C-terminus of the molecule are removed by 
exopeptidase(s). One common example of the latter is the removal of the N-terminal 
methionine from proteins expressed in bacterial cell systems. The removal generally is 
only partial so that the final product will contain a mixture of two forms of the protein, 
one with and one without the methionine. The ratio of the two forms depends upon the 
host expression system, expression conditions and the polypeptide itself. 

Where the host cell is bacterial, such as E. coli, the product obtained from the 
culture may require folding for optimum functional activity. This is most likely if the 
protein is expressed as inclusion bodies. There are a number of aspects of the isolation 
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and folding process that are regarded as important. In particular, the polypeptide is 
preferably partially purified before folding, in order to minimise formation of aggregates 
with contaminating proteins and minimise misfolding of the polypeptide. Thus, the 
removal of contaminating £. coli proteins by specifically isolating the inclusion bodies 
and the subsequent additional purification prior to folding are important aspects of the 
procedure. 

The folding process is carried out in such a way as to minimise aggregation of 
intermediate-folded states of the polypeptide. Thus, careful consideration needs to be 
given to, among others, the salt type and concentration, temperature, protein 
concentration, redox buffer concentrations and duration of folding. The exact condition 
for any given polypeptide generally cannot be predicted and must be determined by 
experiment. 

There are numerous methods available for the folding of proteins from inclusion 
bodies and these are known to the skilled worker in this field. The methods generally 
involve breaking all the disulphide bonds in the inclusion body, for example with 50mM 
2-mercaptoethanol, in the presence of a high concentration of denaturant such as 8M urea 
or 6M guanidine hydrochloride. The next step is to remove these agents to allow folding 
of the proteins to occur. Formation of the disulphide bridges requires an oxidising 
environment and this may be provided in a number of ways, for example by air, or by 
incorporating a suitable redox system, for example a mixture of reduced and oxidised 
glutathione. 

Preferably, the inclusion body is solubilised using 8M urea, in the presence of 
mercaptoethanol, and protein is folded, after initial removal of contaminating proteins, by 
addition of cold buffer. A preferred buffer is 60mM ethanolamine containing ImM 
reduced glutathione and 0.5mM oxidised glutathione. The folding is preferably carried 
out at a temperature in the range 1 to 5°C over a period of 1 to 4 days. 

If any precipitation or aggregation is observed, the aggregated protein can be 
removed in a number of ways, for example by centrifugation or by treatment with 
precipitants such as ammonium sulphate. Where either of these procedures are adopted, 
monomeric polypeptide is the major soluble product. 

If the bacterial cell secretes the protein, folding is not usually necessary. 

Peptide linkages in the derivatives of the invention may be made chemically or 
biosynthetically by expression of appropriate coding DNA sequences. Non peptide 
linkages may be made chemically or enzymatically by post-translational modification. 
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The polypeptide portion of the derivatives of the invention may be prepared by 
expression in suitable hosts of modified genes encoding the soluble polypeptide of the 
invention plus one or more peptidic membrane binding elements and optional residues 
such as cysteine to introduce linking groups to facilitate post translation^ derivatisation 
with additional membrane binding elements. 

The polypeptide portion of the derivative of the invention may include a C- 
termmal cysteine to facilitate post translational modification. Expression in a bacterial 
system is preferred for proteins of moderate size (up to ~70kDa) and with <~8 disulphide 
bridges. More complex proteins for which a free terminal Cys could cause refolding or 
stability problems may require stable expression in mammalian cell lines (especially 
CHO). This may also be needed if a carbohydrate membrane binding element is to be 
introduced post-translationally. The use of insect cells infected with recombinant 
baculovirus encoding the polypeptide portion is also a useful general method for 
preparing more complex proteins and will be preferred when it is desired to carry out 
certain post-translational processes (such as palmitoylation) biosynthetically (see for 
example, M.J.Page et al J.Biol.Chem. 264, 19147-19154, 1989). 

A preferred method of handling proteins C-terrninally derivatised with cysteine is 
as a mixed disulphide with mercaptoethanol or glutathione or as the 2-nitro 5- 
carboxyphenyl thio- derivative as generally described below in Methods. 

Peptide membrane binding elements may be prepared using standard solid state 
synthesis such as the Merrifield method and this method can be adapted to incorporate 
required non-peptide membrane binding elements such as N-acyl groups derived from 
mynstic or palmitic acids at the N terminus of the peptide. In addition activation of an 
amino acid residue for subsequent linkage to a protein can be achieved during chemical 
synthesis of such membrane binding elements. Examples of such activations include 
formation of the mixed 2-pyridyl disulphide with a cysteine thiol or incorporation of an 
N-haloacetyl group. Peptides can optionally be prepared as the C-terminal amide 

-The derivatives of the invention may utilise a peptide membrane binding element 
comprising one or more derivatisations selected from: 

a terminal cysteine residue optionally activated at the thiol group; 
an N-haloacetyl group (where halo signifies chlorine, bromine or iodine) located 
at the N-terminus of the the peptide or at an e-amino group of a lysine residue; 
an amide group at the C-terminus; and 

a fatty acid N-acyl group at the N-terminus or at an e-amino group of a lysine 
residue. 
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Chemical bridging groups include those described in EP0109653 and EP0152736. 
The bridging group is generally of the formula: _ 

-A-R-B- (V) 

in which each of A and B, which may be the same or different, represents -CO-, 
-C(=NH2 + )~> rnaleimido, -S- or a bond and R is a bond or a linking group containing one 
or more -(CH2)- or meta- or para- disubstituted phenyl units. 

Where the polypeptide portion of the derivative of the invention and a peptidic 
membrane binding element both include a C-terminal cysteine the chemical bridging 
group will take the form -S-S-. The bridge is generated by conventional disulphide 
exchange chemistry, by activating a thiol on one polypeptide and reacting the activated 
thiol with a free thiol on the other polypeptide. Such activation procedures make use of 
disulphides which form stable thiolate anions upon cleavage of the S-S linkage and 
include reagents such as 2,2' dithiopyridine and 5,5'-dithio(2-nitrobenzoic acid, DTNB) 
which form intermediate mixed disulphides capable of further reaction with thiols to give 
stable disulphide linkages. 

R may include moieties which interact with water to maintain the water solubility 
of the linkage and suitable moieties include -CO-NH-, -CO-NMe-, -S-S-, -CH(OH)-, 
-S02-, -CO2-, -(CH2CH2-0) m - and -CH(COOH)- where m is an integer of 2 or more. 

Examples of R include -(CH 2 ) r , -(CH2) p -S-S-(CH 2 )q- and 
-(CH2)p-CH(OH)-CH(OH)-(CH 2 ) q -, in which r is an integer of at least 2, preferably at 
least 4 and p and q are independently integers of at least 2. 

The bridging group of formula (V) may be derived from a linking agent of 
formula (VI): 

X-R r Y (VI) 

in which R| is a linking group containing one or more -(CH2)- units and X and Y 
are functional groups reactable with surface amino acid groups, preferably a lysine or 
cysteine group, or the N-terminal amino group, or a protein attachment group. 

Preferred agents are those where X and Y are different, known as 
heterobifunctional agents. Each end of the agent molecule is reacted in turn with each 
polypeptide to be linked in separate reactions. Examples of heterobifunctional agents of 
formula (VI) include: 

N-succinimidyl 3-(2-pyridyldithio) propionate 

succinimidyl 4-(N-maleimido) caproate 

3-(2-pyridyl) methyl propionimidate hydrochloride 
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In each case Y incapable of reacting with a thiol group on a polypeptide, which 
may be a native thiol or one introduced as a protein attachment group. 

The protein attachment group is a functionality derived by modification of a 
polypeptide with a reagent specific for one or more amino acid side chains, and which 
contains a group capable of reacting with a cleavable section on the other polypeptide 
An example of a protein attachment group is a thiol group. An example of a cleavable 
section is a disulfide bond. Alternatively the cleavable section may comprise an a, B 
dihydroxy function. 

As an example, the introduction of a free thiol function by reaction of a 
polypeptide with 2-iminothiolane, N-succinimidyl 3-(2-pyridyldithio) propionate (with 
subsequent reduction) or N-acetyl homocysteine thiolactone will permit coupling of the 
protein attachment group with a thiol-reactive B structure. Alternatively, the protein 
attachment group can contain a thiol-reactive entity such as the 6-maleimidohexyl group 
or a 2-pyndyl-dithio group which can react with a free thiol in X. Preferably, the protein 
attachment group is derived from protein modifying agents such as 2-iminothiolane that 
react with lysine e-amino groups in proteins. 

When X represents a group capable of reacting directly with the amino acid side 
chain of a protein, it is preferably an N-succinimidyl group. When X represents a group 
capable of reacting with a protein attachment group, it is preferably a pyridylthio group. 

In the above processes, modification of a polypeptide to introduce a protein 
attachment group is preferably carried out in aqueous buffered media at a pH between 3 0 
and 9.0 depending on the reagent used. For a preferred reagent, 2-iminothiolane, the pH 
is preferably 6.5-8.5. The concentration of polypeptide is preferably high (> lOmg/ml) 
and the modifying reagent is used in a moderate (1.1- to 5-fold) molar excess, depending 
on the reactivity of the reagent. The temperature and duration of reaction are preferably 
in the range 0°-40° C and 10 minutes to 7 days. The extent of modification of the 
polypeptide may be determined by assaying for attachment groups introduced. 

Such assays may be standard protein chemical techniques such as titration with 
3,5'-dithiobis-(2-nitroben Z oic acid). Preferably, 0.5-3.0 moles of protein attachment 
group will be introduced on average per mole of polypeptide. The modified polypeptide 
may be separated from excess modifying agents by standard techniques such as dialysis 
ultrafiltration, gel filtration and solvent or salt precipitation. The intermediate material ' 
may be stored in frozen solution or lyophilised. 
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Where a protein attachment group is introduced in this way, the bridging group 
(V) will be formed from a reaction of the linking agent (VI) and the protein attachment 
group. 

The polypeptides to be linked are reacted separately with the linking agent or the 
reagent for introducing a protein attachment group by typically adding an excess of the 
reagent to the polypeptide, usually in a neutral or moderately alkaline buffer, and after 
reaction removing low molecular weight materials by gel filtration or dialysis. The 
precise conditions of pH, temperature, buffer and reaction time will depend on the nature 
of the reagent used and the polypeptide to be modified. The polypeptide linkage reaction 
is preferably carried out by mixing the modified polypeptides in neutral buffer in an 
equimolar ratio. Other reaction conditions e.g. time and temperature, should be chosen to 
obtain the desired degree of linkage. If thiol exchange reactions are involved, the 
reaction should preferably be carried out under an atmosphere of nitrogen. Preferably, 
UV-active products are produced (eg from the release of pyridine 2-thione from 2- 
pyridyl dithio derivatives) so that coupling can be monitored. 

After the linkage reaction, the polypeptide conjugate can be isolated by a number 
of chromatographic procedures such as gel filtration, ion-exchange chromatography, 
affinity chromatography or hydrophobic interaction chromatography. These procedures 
my be either low pressure or high performance variants. 

The conjugate may be characterised by a number of techniques including low 
pressure or high performance gel filtration, SDS polyacrylamide gel electrophoresis or 
isoelectric focussing. 

Membrane binding elements which are fatty acid derivatives are attached post 
translationally to a peptidic membrane binding element, preferably at the terminus of the 
polypetide chain. Preferably, where the recombinant polypeptide portion of the 
derivative of the invention contains the peptidic membrane binding element, it has a 
unique cysteine for coupling to the fatty acid derivative. Where the recombinant 
polypeptide has a cysteine residue, a thiol-derivative of the fatty acid is added to the 
refolded recombinant protein at a late stage in purification (but not necessarily the final 
stage) and at a reagent concentration preferably below the critical micelle concentration. 
One of the fatty acid derivative and the recombinant peptide will have the thiol group 
activated as described above for thiol interchange reactions. The fatty acid derivative is 
preferably a fatty acyl derivative of an aminoC2-6 a l' <: ane thiol (optionally C-substituted) 
such as N-(2-myristoyl ) aminoethanethiol or N-myristoyl L-cysteine. 
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PEG of *„. weight baween m a „ d Wd^^^r^ 

seoarano,, of H , " and ca "° n change chromatography for 

separate of denvansed and underused proteins i„,o whi ch a hydrophobic 
efecrroaaac swtch combination has been inserted y°™phob,c- 

tG >«e portion of the 

DNA. " PreSS1 ° n VeCt ° rS and ^ombmant host cells containing the 

The polypeptide or derivative of this invention is useful i„ the treatment or 

Disease and Disorders Involving Complement 
Neurological Disorders 

multiple sclerosis 
stroke 

Guillain Barre Syndrome 
traumatic brain injury 
Parkinson's disease 
allergic encephalitis 
Alzheimer's disease 
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Disorders of Inappropriate or Undesirable Complement Activation 

haemodialysis complications _ 
hyperacute allograft rejection 
xenograft rejection 
corneal graft rejection 

interleukin-2 induced toxicity during IL-2 therapy 
paroxysmal nocturnal haemoglobinuria 

Inflammatory Disorders 

inflammation of autoimmune diseases 
Crohn's Disease 

adult respiratory distress syndrome 

thermal injury including burns or frostbite 

uveitis 

psoriasis 

asthma 

acute pancreatitis 
Kawasaki's disease 

Post-Ischemic Reperfusion Conditions 

myocardial infarction 
balloon angioplasty 

atherosclerosis (cholesterol-induced) & restenosis 
hypertension 

post-pump syndrome in cardiopulmonary bypass or renal haemodialysis 
renal ischemia 
intestinal ischaemia 

Infectious Diseases or Sepsis 
multiple organ failure 
septic shock 

Immune Complex Disorders and Autoimmune Diseases 

rheumatoid arthritis 

systemic lupus erythematosus (SLE) 
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SLE nephritis ~~~~ 
proliferative nephritis 
glomerulonephritis 
haemolytic anemia 
myasthenia gravis 

Reproductive Disorders 

antibody- or complement-mediated infertility 

Wound Healing 



a the™ ^ • Pr r n I; nVenti0n iS aIs ° directed *> a Pharmaceutical composition comprising 
a therapeutically effective amount of a polypeptide or derivative, as above, and a 
pharmaceutical^ acceptable carrier or excipient. 

The present invention also provides a method of treating a disease or disorder 
assorted with inflammation or inappropriate complement activation comprising 
admimstering to a subject in need of such treatment a therapeutically effective amount of 
a polypeptide or derivative of this invention. 

In the above methods, the subject is preferably a human 

or d' « T effeC t e am ° Um ° f P ° lypeptide ° r Native for the treatment of a disease 
or disorder is in the dose range of 0.01-1 OOmg/kg; preferably 0.1mg-10mg/k g 

For administration, the polypeptide or derivative should be formulated into an 
appropriate pharmaceutical or therapeutic composition. Such a composition typically 
contains a therapeutically active amount of the polypeptide or derivative and a 
pharmaceutical^ acceptable excipient or carrier such as saline, buffered saline, dextrose 
or water. Compositions may also comprise specific stabilising agents such as sugars ' 
-eluding mannose and mannitol, and local anaesthetics for injectable compositions ' 
including, for example, lidocaine. 

Further provided is the use of a polypeptide or derivative of this invention in the 
manufacture of a medicament for the treatment of a disease or disorder associated with 
inflammation or inappropriate complement activation. 

In order to inhibit complement activation and, at the same time, provide 
thrombolytic therapy, the present invention provides compositions which further 
comprise a therapeutically active amount of a thrombolytic agent. An effective amount 
of a thrombolytic agent is in the dose range of O.OMOmg/kg; preferably 0.1-5mg/kg 
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Preferred thrombolytic agents include, but are not limited to, streptokinase, human tissue 
type plasminogen activator and urokinase molecules and derivatives, fragments or 
conjugates thereof. The thrombolytic agents may comprise one or more chains that may 
be fused or reversibly linked to other agents to form hybrid molecules (EP-A-0297882 
and EP 155387), such as, for example, urokinase linked to plasmin (EP-A-0152736), a 
fibrinolytic enzyme linked to a water-soluble polymer (EP-A-0183503). The 
thrombolytic agents may also comprise muteins of plasminogen activators (EP-A- 
0207589). In a preferred embodiment, the thrombolytic agent may comprise a reversibly 
blocked in vitro fibrinolytic enzyme as described in U.S. Patent No. 4,285,932. A most 
preferred enzyme is a p-anisoyl plasminogen-streptokinase activator complex as 
described in U.S. Patent No. 4,808,405, and marketed by SmithKline Beecham 
Pharmaceuticals under the Trademark EMINASE (generic name anistreplase, also 
referred to as APS AC; Monk et al, 1987, Drugs 34:25-49). 

Routes of administration for the individual or combined therapeutic compositions 
of the present invention include standard routes, such as, for example, intravenous 
infusion or bolus injection. Active complement inhibitors and thrombolytic agents may 
be administered together or sequentially, in any order. 

The present invention also provides a method for treating a thrombotic condition, 
in particular acute myocardial infarction, in a human or non-human animal. This method 
comprises administering to a human or animal in need of this treatment an effective 
amount of a polypeptide or derivative according to this invention and an effective amount 
of a thrombolytic agent. 

Also provided is the use of a polypeptide or derivative of this invention and a 
thrombolytic agent in the manufacture of a medicament for the treatment of a thrombotic 
condition in a human or animal. Such methods and uses may be carried out as described 
in WO 91/05047. 

This invention further provides a method for treating adult respiratory distress 
syndrome (ARDS) in a human or non-human animal. This method comprises 
administering to the patient an effective amount of a polypeptide or derivative according 
to this invention. 

The invention also provides a method of delaying hyperacute allograft or 
hyperacute xenograft rejection in a human or non-human animal which receives a 
transplant by administering an effective amount of a polypeptide or derivative according 
to this invention. Such administration may be to the patient or by application to the 
transplant prior to implantation. 
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The invention yet further provides a method of treating wounds in a human or 
non-human animal by administering by either topical or parenteral e.g. intravenous 
routes, an effective amount of a polypeptide or derivative according to this invention 



26 



P31767 



GENERAL METHODS USED IN EXAMPLES 

(i) DNA Cleavage 

Cleavage of DNA by restriction endonucleases was carried out according to the 
manufacturers instructions using supplied buffers. Double digests were carried out 
simultaneously if the buffer conditions were suitable for both enzymes. Otherwise 
double digests were carried out sequentially where the enzyme requiring the lowest salt . 
condition was added first to the digest. Once the digest was complete the salt 
concentration was altered and the second enzyme added. 

(ii) DNA ligation 

Ligations were carried out using T4 DNA lig-ase purchased from Prornega. as 
described in Sambrook et al, (1989) Molecular Cloning: A Laboratory Manual 2nd 
Edition. Cold Spring Harbour Laboratory Press. 

(in) Plasmid isolation 

Plasmid isolation was using Prornega Wizard^M Plus Minipreps or Qiagen 
Plasmid Maxi kit according to the manufacturer's instructions. 

(iv) DNA fragment isolation 

DNA fragments were excised from agarose gels and DNA extracted using the 
QIAEX gel extraction kit or Qiaquick, or GeneClean gel extraction kits according to the 
manufacturer's instructions (QIAGEN Inc., USA, Bio 101 Inc, USA). 

(v) Introduction of DNA into E. coli 

Plasmids were transformed into E. coli BL21(DE3) (Studier and Moffat, (1986), 
J. MoL Biol 189:1 13) that had been made competent using calcium chloride as described 
in Sambrook et al, (1989). E. coli JM109 and XLl-blue strains were purchased as a 
frozen competent culture from Prornega. 

(vi) DNA sequencing 

Plasmid DNA is sequenced on a Vistra DNA Labstation 625. The sequencing 
chemistry is performed using Amersham International's Thermo Sequenase fluorescent 
dye- terminator cycle sequencing kit' (RPN 2435), in conjunction with their 'FMP 



27 



P31767 



fluorescent dye-terminator precipitation kit' (RPN 2433) according to the manufacturer's 

instructions. 

a T5 t SCqUenCeS Pr<XlUCed 5y thB abOVC pr ° Cedure « analysed by a Perkin Elmer 

Pmm 377 DNA Sequencer. This is an electrophoretic technique using 36 cm x 
02mm 4% acrylamide gels, the fluorescently labeled DNA fragments being detected by a 
charge coupled device camera according to the manufacturer's instructions. 

(vii) Production of oligonucleotides 

Oligonucleotides were purchased from Cruachem. 

(viii) pBROC413 

The plasmid P T7-7 [Tabor, S (1990), Current Protocols in Molecular Biology F 
A. Ausubel, Brent, R. E. Kingston, D. D. Moore, J. G. Seidman, J. A. Smith, and K ' 
Struhl.eds.] pp.16.2.1-16.2.1 1, Greene Publishing and Wiley-Interscience.New York ] 
contains DNA corresponding to nucleotides 2065-4362 of pBR322 and like pBR322 can 
be mobilized by a conjugate plasmid in the presence of a third plasmid ColK A 
mobility protein encoded by ColK acts on the nic site at nucleotide 2254 of pBR322 
initiating mobilization from this point. pT7-7 was digested with Lspl and Bglll and the 
protruding 5' ends filled in with the Klenow fragment of DNA Polymerasel The 
plasmid DNA fragment was purified by agarose gel electrophoresis, the blunt ends 
hgated together and transformed into E. coli DH1 by electroporation using a Bio-Rad 
Gene Pulser and following the manufacturers recommended conditions. The resultant 
plasmid PBROC413 was identified by restriction enzyme analysis of plasmid DNA 

The deletion in pBROC413 from the Lspl site immediately upstream of the f 10 
promoter to the BglU site at nucleotide 434 of P T7-7 deletes the DNA corresponding to 
nucleotides 2065-2297 of P BR322. The nic site and adjacent sequences are therefore 
deleted making pBROC413 non mobilizable. 

(ix) Sodium dodecyl sulphate polyacrylamide gel electrophoresis (SDS PAGE) 

SDS PAGE was carried out generally using the Novex system (British 
Biotechnology) according to the manufacturer's instructions. Prepacked *els of 
acrylam.de concentrations 4 - 20% were most frequently used. Samplesfor 
electrophoresis, including protein molecular weight standards (for example LMW Kit 
Pharmacia or Novex Mark 12) were usually diluted in l%(w/v)SDS - containing buffer 



28 



P31767 



(with or without 5%(v/v) 2-mercaptoethanol), and left at room temperature for about 0.5 
h before application to the gel. _ 

(x) Reduction of disulphides and modification of thiols in proteins 

There are a number of methods used for achieving the title goals. The reasons it 
may be necessary to carry out selective reduction of disulphides is that during refolding, 
concentration and further purification of multi-thiol proteins inappropriate disulphide 
pairing can occur. In addition, even if correct disulphide paring does occur, it is possible 
that a free cysteine in the protein may become blocked with the reducing agent, for 
example glutathione. These derivatives are generally quite stable. In order to make 
them more reactive, for example for subsequent conjugation to another functional group, 
they need to be selectively reduced, with for example dithiothreitol (DTT) or with Tris 
(2-carboxyethyl) phosphine.HCl (TCEP) then optionally modified with a function which 
is moderately unstable. An example of the latter is Ellmans reagent (DTNB) which 
gives a mixed disulphide. In the case where treatment with DTNB is omitted, careful 
attention to experimental design is necessary to ensure that dimerisation of the free thiol- 
containing protein is minimised. Reference to the term 'selectively reduced* above 
means that reaction conditions eg. duration, temperature, molar ratios of reactants have to 
be carefully controlled so that disulphide bridges within the natural architecture of the 
protein are not reduced. All the reagents are commercially available eg. from Sigma or 
Pierce. 

The following general examples illustrate the type of conditions that may be used 
and that are useful for the generation of free thiols and their optional modification. The 
specific reaction conditions to achieve optimal thiol reduction and/or modification are 
ideally determined for each protein batch. 

TCEP may be prepared as a 20mM solution in 50mM Hepes (approx. pH 4.5) and 
may be stored at -40 degrees C. DTT may be prepared at lOmM in sodium phosphate 
pH 7.0 and may be stored at -40 degrees C. DTNB may be prepared at lOmM in sodium 
phosphate pH 7.0 and may be stored at -40 degrees C. All of the above reagents are 
typically used at molar equivalence or molar excess, the precise concentrations ideally 
identified experimentally. The duration and the temperature of the reaction are similarly 
determined experimentally. Generally the duration would be in the range 1 to 24 hours 
and the temperature would be in the range 2 to 30 degrees C. Excess reagent may be 
conveniently removed by buffer exchange, for example using Sephadex G25. A suitable 
buffer is 0. 1M sodium phosphate pH7.0. 
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EXAMPLES 

Example 1 Expression and isolation of CM7 (SEQ ID 18) 

(a) Construction of plasmid P BrocSCRl-3CM7 encoding CM7 

The plasmid coding for SCR1-3 (MQl -> K196) of CR-1, pDB1013-5 (patent 
apphcanon WO 94/00571) was subjected to site directed mutagenesis using a 
QuickChange kit suppled by Sfratagene. Three p airs of oligonucleotides were utilised to 
introduce ten amino acid changes changes to the native SCR3 sequence corresponding to 

l^r 5 TIT, ^ ^ CR " ! ^ PSCUd0gene SCqUenCe dCSCribed b ' de Ll 

1990 (Journal of Biological Chemistry 265, pp 974-980): Each pair of oligonucleotides 

is complementary in sequence. Changes are shown in lower case. 

The first pair: 

«Fmn A Mn^ CCAACGGTGAmCAcCTCTAtCA8TC0CGAG ' A ™AC 

(SEQ ID NO l)and 

SQ A m A NO T 2) CTCGCGACTOaTAGAGSTO 

resulted in five amino acid changes and loss of an Apol restriction site. 

The second pair: 

GACCTACCaCTGCAATCtGGGTAGCcGTGGTaaaAAGGTGTTTGAGC 
(SEQ ID NO 3) and 

GCTCAAACACCTTtttACCACgGCTACCCaGATTGCAGtGGTAGGTC 
(SEQ ID NO 4) 

resulted in four amino acid changes and the acquisition of a BsaJI restriction site. 
The third pair: 

GCACTAGcAAaGACGATCAAGTGGG (SEQ ID NO 5) and 
CCCACTTGATCGTCt TTgCTAGTGC (SEQ ID NO 6) 
resulted in a single amino acid change and the loss of a Spel restriction site 
To generate DNA encoding CM7 all six oligonucleotides were used 
simultaneously in the mutagenesis reaction and transformed into competent XLl-Blue E 
coU (Stratagene). Resulting colonies were grown up in LBroth and plasmids extracted ' 
using standard methodology. The plasmids were screened for sucessful mutagenesis by 
the loss of the Apol or Spel restriction sites or the acquisition of a new BsaJI restriction 
sue. In the first experiment only the oligonucleotide pair (SEQ ID NOs 5 and 6) were 
incited resulting in the loss of the Spel restriction site. This plasmid (pBrocSCRl- 
3P3) was subjected to further rounds of site-directed mutagenesis usin- the 
oligonucleotides SEQ ID NOs 1, 2, 3 and 4. From this was produced the mutated 
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plasmid pBrocSCl-3P7 containing all ten amino acid coding changes in the SCR3 coding 
domain. 

Using the restriction enzymes EcoRI and Hindlll, the mutated SCR3 domain was 
excised from the plasmid pBrocSCRl-3P7 as a 229 base pair fragment and ligated back 
into a 2540 base pair EcoRI/Hindlll fragment of pDB1031-5 containing vector and 
SCR 1-2 sequences to minimise the possibility of unwanted mutations elsewhere in the 
plasmid. The resulting plasmid was pBrocSCRl-3CM7 encoding the protein CM7 (SEQ 
ID NO: 18). 

(b) Expression of CM7 from pBrocSCRl-3CM7 

pBrocSCRl-3CM7 was transformed into calcium chloride competent E. coli 
BL21(DE3) and resultant colonies were isolated and checked for plasmid content. To 
express protein from pBrocSCRl-3CM7 in £. coli BL21(DE3), a single colony was 
inoculated into 10 ml LB-phosphate media (20g/L tryptone, 15g/L yeast extract, 0.8g/L 
NaCl, 0.2g/L Na 2 HP0 4) 0.1 g/L KH 2 P0 4 ) containing 50ug/ml ampicillin. The culture 
was grown for 6 hours at 37°C, 230 r.p.m. before being used to inoculate 100 ml of the 
same media containing 50 ug/ml ampicillin. Growth was under the same conditions 
overnight. 25 ml of each culture were then used to inoculate 600 ml of the same media 
with 50 ug/ml ampicillin in 3 L erlenmeyer flasks. Cells were grown to an OD of about 
0.25 at A6Q0 nm * IPTG (isopropyl B-D galactopyranoside) was added to a final 
concentration of 1 mM and cells allowed to continue growth for a further about 8 hours 
before harvesting by centrifugation at 8000 g/10 min. Pellet from 2L of culture was 
stored at -40°C 

(c) Isolation, refolding, purification and formulation of CM7 

The methods described are essentially those detailed in Dodd I. et al (1995) 
Protein Expression and Purification 6 727-736 with some modifications, 
(i) Isolation of solubilised inclusion bodies 

The frozen cell pellet of E. coli BL21(DE3) (pBrocSCRl-3CM7) was allowed to 
thaw at. room temperature for 2h and resuspended in 50 mM Tris/50 mM NaCl/1 mM 
EDTA pH 8.0 (approx. 60ml). The suspension was transferred to a glass beaker 
surrounded by ice and sonicated (Heat systems - Ultrasonics W380; 50 x 50% pulse, 
pulse time = 5 sec.) for 3 min. and then spun at 7000 rpm for 20 min. The supernatant 
was decanted and discarded The pellet was resuspended in 20 mM Tris/8M urea/1 mM 
EDTA/50 mM 2-mercaptoethanol pH 8.5 (80ml) at room temperature by vigorous 
swirling, then left for Ih at room temperature with occasional swirling. 
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(ii) Initial purification using SP-Sepharose 

To the viscous solution was added SP-Sepharose FF (30g wet weight) that had 
been water washed and suction-dried. The mixture was swirled vigorously and left static 
for l-2h at room temperature. The supernatant was decanted, sampled and discarded. 
The remaining slurry was resuspended to a uniform suspension and poured into a glass 
jacket and allowed to settle into a packed bed. The column was equilibrated with 0.02M 
Tris/8M urea/0.05M 2-mercaptoethanol/0.001 M EDTA pH 8.5. at room temperature. 
When the A28O °f t ^ ie eluate had stabilised at baseline, the buffer was changed to 
equilibration buffer additionally containing 1M NaCl. A single A28O peak was eluted 
by the 1M NaCl-containing buffer; the volume was approx. 40ml. The protein 
concentration of a sample of this solution that had been buffer-exchanged (Sephadex 
G25) into 50 mM formic acid was estimated by A2gQ determination, using a molar 
extinction coefficient of 25000 cm' 1 and the formula A = ECL where A is the absorption 
of the solution at 280nm, E is the molar extinction coefficient, C is the molar 
concentration of protein and L is the light path in cm. This showed the product had a 
protein concentration of 0.44mg/ml. SDS PAGE showed that the product contained 
apparently two major species, both wih a molecular weight around 22 000. The solution 
was stored at -40°C. 

(iii) Folding and further processing 

The 40ml SP-Sepharose-purified product was added gradually over a 1 min 
period to 1240 ml freshly prepared, cold 0.06 M ethanolamine/lmM EDTA with 
continuous swirling, and left static for 1 h/4°C Reduced glutathione (GSH) was added 
to 1 mM and oxidised glutathione (GSSG) was added to 0.5 mM by the addition of 100- 
fold concentrates of both solutions. The solution was clear and was left static approx 2- 
3°C for 5d. The solution was then ultrafiltered using a YM10 membrane to a final 
retentate volume of 59 ml; the retentate was clear and it was mixed with 9 vol. 0.1M 
NaH2P04/lM (NH4>2S04 pH 7.0 (Buffer A) at room temperature and immediately 
centrifuged at 3000 rpm for 20 min. The supernatant was applied to a column 
(i.d. ,26mm, h. ,100mm) of Butyl Toyopearl 650M and the column was developed using a 
linear gradient of Buffer A to 0.1 M sodium phosphate pH7.0. All the chromotography 
was at room temperature at 2 ml / min. A single A280-peak was noted during the 
gradient. The fractions spanning the peak were pooled and were ultrafiltered (YM10) to 
2.5ml. This retentate was regarded as the product. The product contained one major 
species by non-reduced SDS PAGE with an estimated purity of >95% and an apparent 
molecular weight of 21 000. A sample of the product diluted 10-fold in 0.1M sodium 
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phosphate pH7.0 had an A280 of 0.47; using an extinction coefficient of 34000 allowed 
the proton concentration to be calculated as 140^M. CIS Poros HPLC using a 

approx. 99%. Electrcspray mass spectrophotometry gave a mass of 21887. 

Example 2 Construction of plasmids pBrocSCRl-3CMl 2 i * < 
CM1, CM2, CM3, CMS and CM6 ' ' * 6 enC ° ding 

TTie plasmids pBrocSCRl-SPl, 2, 3, 5 and 6 were generated by site directed 

pBrocSCRl-3Pl was made using only SEQ ID NOs 1 and 2 
pBrocSCRl-3P2 was made using only SEQ ID NOs 3 and 4 
P BrocSCRl-3P3 was made using only SEQ ID NOs 5 and 6 
P BrocSCRl-3P5 was made using SEQ ID NOs 3 to 6 and 
pBrocSCRl-3P6 was made using SEQ ID NOs 1, 2, 5 and 6 

Plasmids pBrocSCRl-3CMl o t. k ^ 

v ^ki JLM1, 2, 3, 5 and 6 were generated by excision of 2^9 base nair 
fragments encoding the mutated srci^ • c , "^-^oasepair 
pBrocSCRI ^ TL 7 PlaSmidS P BrocS CRl-3Pl to 

eLTi^ dr" n f " a§mentS ^ ligat6d ^ int ° a 2540 ^se pair 

EcoRI/HmdIII fragment of pDB1013-5 containing vector and SCR1-2 sequences to 
_ the possibility of unwanted mutations elsewhere in the plasmi^ " 
plasmids were P BrocSCRl-3CMl, 2, 3, 5 and, encode the proteins CM1 CM2 CM, 
CM5 and CM6 (SEQ ID NOs 8, 10, 12, 14 and 16. resp^tivSy) 

CM1.CM2.CM3. CMS and CM6 were expressed in £. using methods 

PAGEfoHo d b eSCribed ^ EX3mPle 1 " AnaIySiS ° f ^ ^ SDS 

~ o7a \ Stami : S Pr0tem Sh ° Wed that ^ IPTG — ^ » ^d 
expression of a polypeptide with a molecular weight of about 22 000, which was 

-distinguishable from CM7 analysed in the same way. The whole cell pel Jmay be 
stored at -40 degrees C or below until required. y 

CM7 Jobt ^ COli r h Pdlet ^ Pr ° CeSSed " 6XaCtly the Same ^ as descri ^d for 
t0 ° btain P Unfied ' concentrated protein for further evaluation. 
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Example 3 Construction of plasmids pBrocSCR3CM8, 9, 10, 12, 13 and 14 
encoding CM8, CM9, CM10, CM12, CM13 and CMW^SEQ ID NOs: 20, 22, 24, 26, 
28 and 30) 

The plasmids pBrocSCR3CM8, 9, 10, 12, 13 and 14were generated from the 
plasmids pBrocSCRl-3Pl, 2, 3, 5, 6 and 7 and the plasmid pBroc435 (patent application 
WO 94/00571). Digestion of pBrocSCrl-3Pl, 2, 3, 5, 6 and 7 with the restriction 
enzymes EcoRI and Hindlll yielded fragments of 229 base pairs encoding the mutated 
SCR3 domains from pBrocSCRl-3Pl, 2, 3, 5, 6 and 7, and these were ligated with a 
EcoRI/Hindlll fragment of 2171 base pairs from pBroc435 containing vector and 
promoter sequences. The resulting plasmids were pBrocSCR3CM8, 9, 10, 12, 13 and 14, 
encoding the proteins CM8, CM9, CM10, CM12, CM13 and CM14 (SEQ ID NOs: 20, 
22, 24, 26, 28 and 30, respectively), i.e., mutant SCR3 domains identical in sequence to 
the SCR3 domains of CM1, CM2, CM3, CMS, CM6 and CM7. 

CM8, CM9, CM10, CM12, CM13 and CM14 may be expressed in E. coli using 
methods similar to those described for CM7 in Example 1. The whole cell pellet may 
be stored at -40 degrees C or below until required. 

The E. coli cell pellet can be processed in a similar way to that described for CM7 
cell pellet to obtain purified, concentrated protein for further evaluation. More 
preferably, modifications may be made to the isolation protocol, for example those 
described in Clark, N.S, 1996 (Ph. D. Thesis, Southampton University), 

Example 4 Expression and isolation of CM7/cys (SEQ ID NO 31) 

(i) Construction of plasmid pDB1030 encoding [SCR l-3]-Cys (SEQ ID NO 36) 

The plasmid coding for SCR 1 -3 of LHR-A of CR 1 , pDB 1013-5 (patent 
application WO 94/00571) was digested with restriction endonucleases EcoKl and 
Hindlll and the 2.2 kB plasmid band was isolated from an agarose gel using a Qiagen 
Qiaex DNA extraction kit according to the manufacturer's instructions. This is fragment 
1. A second batch of pDB1013-5 was digested with Banl and EcoRI and the 196 bp 
band was extracted from agarose as above. This is fragment 2. Two oligonucleotides, 
SEQ ID NO. 34 and , SEQ ID NO. 35, were annealed to give a final DNA concentration 
of 100 pmoles/ul. The annealed oligo has a Banl/EcoRl overhang and duplicates the 
sequence at the 3' end of pDB 1 0 1 3-5 but in addition contains a codon coding for cysteine 
just before the stop codon. This is fragment 3. 
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RDB1O30. The l.gated plasmid was .ransformed imo cc.mpe.en, E. coli JM109 
purchased frorn Promega. Resting colonies were ana.ys'ed by resiricdon endonuclease 

SCR(I-3) (SEQ ID No.27 of WO 94/00571) had been altered by a single C-terminal 

cysteine residue to give SEQ ID NO.36. 

(ii) Title compound 

CM7/cys is prepared as follows. The plasmid encoding CM7 described in 

488 a Z PB ;° CSCR1 - 3CM7 ' is Cleaved with the restriction enzyme BanI to generate a 
488 base P a lr fragment encoding the central region of SCR1-3 including the entire 

™ ^J 10 " SamC enZyme iS USed t0 Cleave the P^smid PDB1030 encoding 
LbCRl-3]-Cys from (i); by incomplete digestion this yields a 228 1 base pair fragment 
comprising the vector sequence, the start of the SCR1-3 gene and the C terminus of 
SCR1-3 including the nucleotides encoding the carboxy-terminal cysteine residue 
Ligation of the 488 base pair fragment from P BrocSCRl-3CM7 and the 2281 base pair 
fragment from P DB1030 in the correct orientation yields P BrocSCRl-3CM7/c V s 
encoding the protein CM7/cys (SEQ ID NO 31). Alternative cloning strategies to the 
same end point are also possible. 

rvr 7 • ? 17/C " S 15 eXPreSSCd in ^ USing meth ° dS identical to ^ described for 

" EXampIe L ThC Wh ° le cel1 -ay be stored at -40 degrees C or below until 

required. 

The E. coli cell pellet is processed in exactly the same way as described for CM7 
to obtain purified, concentrated protein for further evaluation. De P endin a upon the 
conditions used, the C-terminal cysteine of the protein in the final prep™ may be 
blocked with glutathione. 

Example 5 Preparation of [CM7]-Cys-S-S-[MS WP-1] (PM-9) (SEQ ID No. 32) 

H 2 N-[CM7]-Cys-OH 



S 

I 

S 



N ^- to yl).G*^ 

(i) Myristoyl/Electrostatic Switch Peptide Reagent 1 (MSWP-1) 
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N-(Myristoyl) -GIy-Ser-Ser-Lys-Ser-Pro-Ser-Lys-Lys-Lys-Lys-Lys-Lys-Pro-Gly-Asp-(S-2- 
ThiopyridyI)Cys-NH 2 

The peptide: 

Gly-Ser-Ser-Lys-Ser-Pro-Ser-Lys-Lys-Lys-Lys-Lys-Lys-Pro-Gly-Asp-Cys-NH2 
(SEQ ID NO 33) 

was prepared using solid phase synthesis via the general Fmoc/tBu strategy developed by 
Sheppard and Atherton (E.Atherton and R.C.Sheppard, Solid Phase Synthesis, IRL Press, 
Oxford, 1989). Kieselguhr-supported polydimethylacrylamide resin (Macrosorb 100) 
was used as the solid support and was derivatised with ethylene diamine. 

Coupling reactions were carried out using N-a-Fmoc protected reagents pre- 
activated with N,N , -diisGprGpylcarbodiimide/ N-hydroxybenzotriazole (in 4-fold molar 
excess) with bromophenol blue monitoring. Fmoc cleavages used 20% piperidine in 
DMF. Reactions to assemble the peptide chain were carried out by repeated cycles of 
coupling and deprotection including the attachment of the modified Rink linkage reagent 
(p.-[(R,S)-a-[l-(9H-fluoreny-9-yl-methoxyformamido ] 2,4 dimethoxybenzyl]- 
phenoxyacetic acid) designed to yield a C-terminal amide on final cleavage. The side 
chain functionalities of the individual amino-acids were protected as follows: 
Ser (tButyl), Lys (Boc), Asp (O-tButyl), Cys (Trityl). 

On completion of the peptide assembly and with the peptide still attached to the 
resin, the myristoyl group was attached to the amino group of the N terminal glycine by 
direct coupling of myristic acid by the same activation procedure. This modified peptide 
was then cleaved from the resin and the side-chain protecting groups removed at the 
same time by treatment with trifluoracetic acid containing 2.5% water and 2.5% 
triisopropyl silane. 

The crude product was treated with 2,2' dithiopyridine in 0.01M ammonium 
acetate solution at pH 8-9 for approx. 2h, then acidified with acetic acid and purified by 
preparative high performance liquid chromatography (HPLC) in 0.1% trifluoracetic acid 
(TFA) /water and 0.1% TFA/acetonitrile as gradient component. After lyophilisation, the 
peptide was a white amorphous powder, soluble to at least lOmg/ml in 
dimethylsulphoxide. Fast atom bombardment mass spectrometry gave main peaks at m/e 
2107.8, 2129.7 and 2145.8, corresponding to the monoprotonated, monosodiated and 
monopotassiated molecular ions of the peptide. The 2-thiopyridyl content of the peptide 
was measured by dissolving it to around 0.03mM to 0.2 mM in 0.1 M Sodium Borate pH 
8.0 and reducing by addition of dithiothreitol to 5mM. The change in optical density at 
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343nm was used to calculate the amount of pyridine 2-thione released using an 
extinction coefficient at this wavelength of 8080 cm" M" . This indicated that the peptide 
content was approximately 60% of the dry weight, 
(ii) Title compound 

CM7/cys from Example 4 (50^M; 1.0ml) is mixed with TCEP (Tris-(2- 
carboxyethyl) phosphine ) (20mM; 0.01ml) and incubated at room temperature for 22 h 
0.1ml of 50mM ethanolamine is added and mixed. MSWP-1 from (i) (20mM in 0 1M 
sodium phosphate pH 7.0; 0.0125ml) is added and the solution incubated for a further 4 
h. The solution is then chromatographed on a Sephadex G50 column; suitable buffers 
for the chromatography are 50mM formic acid or 50mM ammonium bicarbonate from 
which the product may be conveniently lyophilised, or phosphate-buffered saline The 
fractions containing the PM-9 may be identified in a number of ways, for example by 
SDS PAGE. These fractions can then be pooled, concentrated if appropriate filtered 
through a 0.22^ membrane if necessary, divided into aliquots as required and then stored 
at - 40 degrees C or lyophilised. 

Example 6 

Expression and isolation of protein CM15 (SEQ ID NO:37) 

CM15 comprises SCR1-3 (MQ1->K196) of CR1 (WO94/0057 1 ) with all the 
amino acid changes corresponding to the CR-1 like pseudogene sequence described by 
Hourcade et al 1990 (Journal of Biological Chemistry 265, pp 974-980) where they 
appear in the region homologous to SCR1-3 (13 amino acid changes in addition to the 10 
changes in CM7). The plasmid P BrocSCRl-3CM15 is generated by site-directed 
mutagenesis of P BrocSCRl-3CM7. Oligonucleotides 20 to 100 bases in length are 
synthesised spanning the regions of change on both strands, in which minimal base 
changes have been introduced to alter the amino acid sequence and generate or delete 
restriction sites for diagnostic purposes.. P BrocSCRl-3CM15 is constructed using 
these oligonucleotides and methodology similar to that used to construct pBrocSCRl- 
3CM7, as described in Example 1 a ). Alternative mutagenesis or cloning procedures are 
feasible. 

CM15 is expressed from P BrocSCRl-3CM15 in E. coli using methods identical 
to those described for CM7 in Example 1. The whole cell pellet may be stored at -40 
degrees C or below until required. 

The E. coli cell pellet is processed in exactly the same way as described for CM7 
to obtain punfied, concentrated CM 15 protein for further evaluation. 
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Example 7 

Expression and isolation of protein CM15/cys (SEQ ID NO:38) 

CM15/cys comprises CM15 (Example 6) with an additional C-terminal cysteine 
residue. Plasmid pBrocSCRl-3CM15/cys is generated by site-directed mutagenesis of 
pBrocSCRl-3CM7/cys (Example 4). Oligonucleotides 20 to 100 bases in length are 
synthesised spanning the regions of change on both strands, in which minimal base 
changes have been introduced to alter the amino acid sequence and generate or delete 
restriction sites for diagnostic purposes. pBrocSCRl-3CM15/cys is constructed using 
these oligonucleotides and methodology similar to that used to construct pBrocSCRl- 
3CM7, as described in Example la). Alternative mutagenesis or cloning procedures are 
feasible. 

CM15/cys protein is expressed from pBrocSCRl-3CM 15/cys in E. coli using 
methods identical to those described for CM7 in Example 1. The whole cell pellet may 
be stored at -40 degrees C or below until required. 

The E, coli cell pellet is processed in exactly the same way as described for CM7 
to obtain purified, concentrated CM1 15/cys protein for further evaluation. Depending 
upon the conditions used, the C-terminal cysteine of the protein in the final preparation 
may be blocked with glutathione. 

Example 8 Preparation of [CM15]-Cys-S-S-[MSWP-1] (SEQ ID NO:39) 

H 2 N-[CM15]-C>s-OH 



N-(myristoyl)-Gly-Ser-Ser-Lys-Ser-^ 

The title compound is prepared from CM 15/cys by procedures described in Example 5. 
Biological Activity 

Anti-complement Activity Measured by the Haemolysis of Sheep Erythrocytes 

Functional activity of complement inhibitors was assessed by measuring the 
inhibition of complement-mediated lysis of sheep erythrocytes sensitised with rabbit 
antibodies (Diamedix Corporation, Miami, USA). Human serum diluted 1:125 or 1:100 
in 0.1 M Hepes/0.15 M NaCl/0.1% gelatin pH 7.4 was used as a source of complement. 
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The serum was prepared from a pool of volunteers essentially as described in Dacie & 
Lewis, 1975. Briefly, blood was warmed to 37<>C for 5 minutes, the clot removed and 
the remaining serum clarified by centrifugation. The serum fraction was split into small 
ahquots and stored at - 1960 C . Aliquots were thawed as required and diluted in the 
Hepes buffer immediately before use. 

Inhibition of complement-mediated lysis of sensitised sheep erythrocytes was 
measured using a standard haemolytic assay using a v-bottom microtitre plate format as 
follows:. 

50 ^1 of a range of concentrations of inhibitor (typically in the region of 0.1 - 100 
nM) diluted in Hepes buffer were mixed with 50 ul of the diluted serum and 100 ul of 
prewarmed sensitised sheep erythrocytes and then incubated for 1 hour at 37<>C. Samples 
were spun at 1600rpm at ambient temperature for 3 minutes before transferring 150 uJ of 
supernatant to flat bottom microtitre plates and determining the absorption at 410 nm 
Maximum lysis (Amax) was determined by incubating serum with erythrocytes in the 
absence of any inhibitor. Background lysis (Ao) was determined by incubating 
erythrocytes in the absence of any serum or inhibitor To check whether the inhibitor 
itself had any effect on lysis, erythrocytes were incubated with inhibitor alone; none of 
the compounds had any direct effect on lysis of the red blood cells. Inhibition was 
expressed as a fraction of the total cell lysis such that IH50 represents the concentration 
of inhibitor required to give 50% inhibition of lysis. 

IH = A - Ao 
Amax - Ao 

where 0 is equivalent to complete inhibition and 1 equals no inhibition. 

The CM7 protein product of Example 1 inhibited complement-mediated lysis of 
sensitised sheep red blood cells with an IH50 of approx. 6 nM. 

Inhibition of zymosan A-induced activation of the alternative pathway by CM7. 

The alternative pathway of complement was activated with zymosan A, a 
complex carbohydrate from yeast (Sigma, catalogue number Z-4250). Zymosan A was 
made 50 mg/ml in Hepes buffer (0.1M Hepes/0.15M NaCl pH 7.4) and vortexed until a 
fine suspension had formed. Human serum was preincubated with different 
concentrations of complement inhibitor diluted in Hepes buffer for 15 mins at 37°C using 
the volumes given below. Zymosan A was then vortexed for a few seconds each time 
before addition to the samples after which samples were incubated for a further 30 mins 
at 37°C. The zymosan A was then spun down at approximately 1 1 ,000g for 30 seconds 
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at ambient temperature. 100 (il of supernatant were added to an equal volume of 
precipitating solution provided in the kit and assayed as described in the technical 
bulletin of Amersham with the C3a des Arg assay RIA kit purchased from Amersham 
International pic, U. K., (human complement C3a des Arg [ 125 I]assay, code RPA 518). 
In the C3a RIA Assay, activation of complement pathways can be followed by measuring 
the release of the anaphylatoxin, C3a and its breakdown product C3a des Arg. Both 
products can be measured using a competitive radio-immuno assay. Each sample was 
assayed in duplicate and a useful dilution was 1/100. 

Volumes of samples added 

serum inhibitor Zymosan A 

Normal Assay 89 [il 20 \i\ 21 ul 

The data were computed essentially as described in the Amersham technical 
bulletin with the exception that the standard curve was not used and data were calculated 
only as B/Bo. 

Controls included maximum activation (A) i.e. serum + zymosan A only, background 
activation (B) i.e. serum + buffer only and background activation in the presence of 
inhibitor (C) i.e. serum + inhibitor only. D is the value of activation of serum in the 
presence of inhibitor and zymosan A, . These values could then be used to determine the 
% inhibition at each inhibitor concentration, using the following formula I. Note that the 
formula looks unusual because of the nature of the assay, in particular that because the 
assay is a competition assay all factors are inversed eg. maximum activation actually 
gives the lowest counts in the assay. 

Formula I : D - A X 100 

C- A 

The IC50 is defined as the concentration of inhibitor required to reduce maximum 
activation by 50%. Using the data generated experimentally and reproduced in the table 
below, the IC50 for CM 7 of Example 1 was about luM and was indistinguishable from 
that of SCR 1-3 prepared as described in WO94/00571. 
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|aM SCR 1-3 CM7 
21.5 106 
15.4 87 
5-4 59 
3.8 65 
1.35 54 
0.96 43 
0.34 i 5 
0.24 17 
0.084 7 

0.06 6 
0.021 -2 
0.015 -3 
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SEQUENCE LISTING _ 
• SEQ ID NO: 1 

CGACCAT CgCCAACGGTGATTTC AcCTCTAtCAgTCGCGAGt ATTTTCAC 
SEQ ID NO: 2 

GTGAAAATaCTCGCGAcTGaTAGAGgTGAAATCACCGTTGGcGATGGTCG 
SEQ ID NO: 3: 

GACCTACCaCTGCAATCtGGGTAGCcGTGGTaaaAAGGTGTTTGAGC 
SEQ ID NO: 4 : 

GCTCAAACACCTTtttACCACgGCTACCCaGATTGCAGtGGTAGGTC 
SEQ ID NO: 5: 

GGACTAGcAAaGACGATCAAGTGGG 
SEQ ID NO: 6 : 

CCCACTTGATCGTCt TTgCTAGTGC 
SEQ ID NO: 7 (CM1 DNA) 



1 


ATGCAGTGCA 


ACGCTCCGGA 


ATGGCTGCCG 


TTCGCGCGCC 


CGACCAACCT 


51 


GACTGATGAA 


TTTGAGTTCC 


CGATCGGTAC 


CTACCTGAAC 


TACGAATGCC 


101 


GCCCGGGTTA 


TAGCGGCCGC 


CCGTTTTCTA 


TCATCTGCCT 


GAAAAACTCT 


151 


GTCTGGACTG 


GTGCTAAGGA 


CCGTTGCCGA 


CGTAAATCTT 


GTCGTAATCC 


201 


GCCAGATCCG 


GTTAACGGCA 


TGGTGCATGT 


GATCAAAGGC 


ATCCAGTTCG 


251 


GTTCCCAAAT 


TAAATATTCT 


TGTACTAAAG 


GTTACCGTCT 


GATTGGTTCC 


301 


TCCAGCGCTA 


CATGCATCAT 


CTCTGGTGAT 


ACTGTCATTT 


GGGATAATGA 


351 


AACACCGATT 


TGTGACCGAA 


TTCCGTGTGG 


TCTGCCGCCG 


ACCATCgCCA 


401 


ACGGTGATTT 


CAcCTCTAtC 


AgTCGCGAGt 


ATTTTCACTA 


TGGTTCTGTG 


451 


GTGACCTACC 


GCTGCAATCC 


GGGTAGCGGT 


GGTCGTAAGG 


TGTTTGAGCT 


501 


CGTGGGTGAG 


CCGTCCATCT 


ACTGCACTAG 


TAATGACGAT 


CAAGTGGGCA 


551 


TCTGGAGCGG 


CCCGGCACCG 


CAGTGCATCA 


TCCCGAA 
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SEQ ID NO: 8 (CHI) 

1 MQCNAPEWLP FARPTNLTDE FEFPIGTYLN YECRPGYSGR PFSIICLKNS 
51 VWTGAKDRCR RKSCRNPPDP VNGMVHVIKG IQFGSQIKYS CTKGYRLIGS 
101 SSATCIISGD TVIWDNETPI CDRIPCGLPP TIANGDFTSI SREYFHYGSV 
151 VTYRCNPGSG GRKVFELVGE PSIYCTSNDD QVGIWSGPAP QCIIPNK 



SEQ ID NO: 9 {CM2 DNA) 



1 


ATGCAGTGCA 


ACGCTCCGGA 


51 


GACTGATGAA 


TTTGAGTTCC 


101 


GCCCGGGTTA 


TAGCGGCCGC 


151 


GTCTGGACTG 


GTGCTAAGGA 


201 


GCCAGATCCG 


GTTAACGGCA 


251 


GTTCCCAAAT 


TAAATATTCT 


301 


TCCAGCGCTA 


CATGCATCAT 


351 


AACACCGATT 


TGTGACCGAA 


401 


ACGGTGATTT 


CATCTCTACC 


451 


GTGACCTACC 


aCTGCAATCt 


501 


CGTGGGTGAG 


CCGTCCATCT 


551 


TCTGGAGCGG 


CCCGGCACCG 



ATGGCTGCCG TTCGCGCGCC CGACCAACCT 
CGATCGGTAC CTACCTGAAC TACGAATGCC 
CCGTTTTCTA TCATCTGCCT GAAAAACTCT 
CCGTTGCCGA CGTAAATCTT GTCGTAATCC 
TGGTGCATGT GATCAAAGGC ATCCAGTTCG 
TGTACTAAAG GTTACCGTCT GATTGGTTCC 
CTCTGGTGAT ACTGTCATTT GGGATAATGA 
TTCCGTGTGG TCTGCCGCCG ACCATCACCA 
AATCGCGAGA ATTTTCACTA TGGTTCTGTG 
GGGTAGCcGT GGTaaaAAGG TGTTTGAGCT 
ACTGCACTAG TAATGACGAT CAAGTGGGCA 
CAGTGCATCA TCCCGAA 



SEQ ID NO: 10 (CM2 ) 

1 MQCNAPEWLP FARPTNLTDE FEFPIGTYLN YECRPGYSGR PFSIICLKNS 

51 VWTGAKDRCR RKSCRNPPDP VNGMVHVIKG IQFGSQIKYS CTKGYRLIGS 

101 SSATCIISGD TVIWDNETPI CDRIPCGLPP TITNGDFIST NRENFHYGSV 

151 VTYHCNLGSR GKKVFELVGE PSIYCTSNDD QVGIWSGPAP QCIIPNK 



43 



ID NO: 11 (CM3 DNA) 



1 


ATGCAGTGCA 


ACGCTCCGGA 


ATGGCTGCCG 


TTCGCGCGCC 


CGACCAACCT 


51 


GACTGATGAA 


TTTGAGTTCC 


CGATCGGTAC 


CTACCTGAAC 


TACGAATGCC 


101 


GCCCGGGTTA 


TAGCGGCCGC 


CCGTTTTCTA 


TCATCTGCCT 


GAAAAACTCT 


151 


GTCTGGACTG 


GTGCTAAGGA 


CCGTTGCCGA 


CGTAAATCTT 


GTCGTAATCC 








rn f f* tt* f* f* 7\ rp/-»m 

x CjCjI bt ATCjT 


CjAI C AAAC3C3C 


ATCCAGTTCG 


251 


GTTCCCAAAT 


TAAATATTCT 


TGTACTAAAG 


GTTACCGTCT 


GATTGGTTCC 


301 


TCCAGCGCTA 


CATGCATCAT 


CTCTGGTGAT 


ACTGTCATTT 


GGGATAATGA 


351 


AACACCGATT 


TGTGACCGAA 


TTCCGTGTGG 


TCTGCCGCCG 


ACCATCACCA 


4 01 


ACGGTGATTT 


CATCTCTACC 


AATCGCGAGA 


ATTTTCACTA 


TGGTTCTGTG 


** J X. 




CZCTCZC A ATPf" 
X o^-rvri X \^\^ 


\J\J\J X nuuOO X 






501 


CGTGGGTGAG 


CCGTCCATCT 


ACTGCACTAG 


cAAaGACGAT 


CAAGTGGGCA 


551 


TCTGGAGCGG 


CCCGGCACCG 


CAGTGCATCA 


TCCCGAA 




ID NO 


:12 (CM3) 










1 


MQCNAPEWLP 


FARPTNLTDE 


FEFP IGTYLN 


YECRPGYSGR 


PFSIICLKNS 


51 


VWTGAKDRCR 


RKSCRNPPDP 


VNGMVH V I KG 


IQFGSQIKYS 


CTKGYRLIGS 


lvl 


SSATCIISGD 


TVIWDNETPI 




111 rJbUr lol 


NKbNr HYCjoV 


151 


VTYRCNPGSG 


GRKVFELVGE 


PSIYCTSKDD 


QVGIWSGPAP 


QCIIPNK 


ID NO 


:13 (CMS DNA) 








1 


ATGCAGTGCA 


ACGCTCCGGA 


ATGGCTGCCG 


TTCGCGCGCC 


CGACCAACCT 


51 


GACTGATGAA 


TTTGAGTTCC 


CGATCGGTAC 


CTACCTGAAC 


TACGAATGCC 


101 


GCCCGGGTTA 


TAGCGGCCGC 


CCGTTTTCTA 


TCATCTGCCT 


GAAAAACTCT 


151 


GTCTGGACTG 


GTGCTAAGGA 


CCGTTGCCGA 


CGTAAATCTT 


GTCGTAATCC 


201 


GCCAGATCCG 


GTTAACGGCA 


TGGTGCATGT 


GATCAAAGGC 


ATCCAGTTCG 


251 


GTTCCCAAAT 


TAAATATTCT 


TGTACTAAAG 


GTTACCGTCT 


GATTGGTTCC 


301 


TCCAGCGCTA 


CATGCATCAT 


CTCTGGTGAT 


ACTGTCATTT 


GGGATAATGA 


351 


AACACCGATT 


TGTGACCGAA 


TTCCGTGTGG 


TCTGCCGCCG 


ACCATCgCCA 


401 


ACGGTGATTT 


CAcCTCTAtC 


AgTCGCGAGt 


ATTTTCACTA 


TGGTTCTGTG 


451 


GTGACCTACC 


GCTGCAATCC 


GGGTAGCGGT 


GGTCGTAAGG 


TGTTTGAGCT 


501 


CGTGGGTGAG 


CCGTCCATCT 


ACTGCACTAG 


CAAaGACGAT 


CAAGTGGGCA 


551 


TCTGGAGCGG 


CCCGGCACCG 


CAGTGCATCA 


TCCCGAA 
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ID NO: 


:14 (CMS) 








- 


1 


MQCNAPEWLP 


FARPTNLTDE 


FEFPIGTYLN 


YECRPGYSGR 


PFSIICLKNS 


51 


VWTGAKDRCR 


RKSCRNPPDP 


VNGMVHVIKG 


IQFGSQIKYS 


CTKGYRLIGS 


1 U 1 


SSATCIISGD 


TVIWDNETPI 


LDR1PCGLPP 


riTNGDFXb T 


NRENFHYGoV 


151 


VTYHCNLGSR 


GKKVFELVGE 


PSIYCTSKDD 


QVGIWSGPAP 


QCIIPNK 


ID NO: 


:15 (CM 6 DNA) 








1 


ATGCAGTGCA 


ACGCTCCGGA 


ATGGCTGCCG 


TTCGCGCGCC 


CGACCAACCT 


51 


GACTGATGAA 


TTTGAGTTCC 


CGATCGGTAC 


CTACCTGAAC 


TACGAATGCC 


101 


GCCCGGGTTA 


TAGCGGCCGC 


CCGTTTTCTA 


TCATCTGCCT 


GAAAAACTCT 


151 


GTCTGGACTG 


GTGCTAAGGA 


CCGTTGCCGA 


CGTAAATCTT 


GTCGTAATCC 


201 


GCCAGATCCG 


GTTAACGGCA 


TGGTGCATGT 


GATCAAAGGC 


ATCCAGTTCG 


251 


GTTCCCAAAT 


TAAATATTCT 


TGTACTAAAG 


GTTACCGTCT 


GATTGGTTCC 


301 


TCCAGCGCTA 


CATGCATCAT 


CTCTGGTGAT 


ACTGTCATTT 


GGGATAATGA 


351 


A AC AC CG ATT 


TGTGACCGAA 


TTCCGTGTGG 


TCTGCCGCCG 


ACCATCACCA 


401 


ACGGTGATTT 


CATCTCTACC 


AATCGCGAGA 


ATTTTCACTA 


TGGTTCTGTG 


451 


GTGACCTACC 


aCTGCAATCt 


GGGTAGCcGT 


GGTaaaAAGG 


TGTTTGAGCT 


501 


CGTGGGTGAG 


CCGTCCATCT 


ACTGCACTAG 


cAAaGACGAT 


CAAGTGGGCA 


551 


TCTGGAGCGG 


CCCGGCACCG 


CAGTGCATCA 


TCCCGAA 





SEQ ID NO: 16 <CM6) 

1 MQCNAPEWLP FARPTNLTDE FEFPIGTYLN YECRPGYSGR PFSIICLKNS 
51 VWTGAKDRCR RKSCRNPPDP VNGMVHVIKG IQFGSQIKYS CTKGYRLIGS 
101 SSATCIISGD TVIWDNETPI CDRIPCGLPP TIANGDFTSI SREYFHYGSV 
151 VTYHCNLGSR GKKVFELVGE PSIYCTSKDD QVGIWSGPAP QCIIPNK 

SEQ ID NO: 17 {CM7 DNA) 



1 


atgCAGTGCA 


ACGCTCCGGA 


ATGGCTGCCG 


TTCGCGCGCC 


CGACCAACCT 


51 


GACTGATGAA 


TTTGAGTTCC 


CGATCGGTAC 


CTACCTGAAC 


TACGAATGCC 


101 


GCCCGGGTTA 


TAGCGGCCGC 


CCGTTTTCTA 


TCATCTGCCT 


GAAAAACTCT 


151 


GTCTGGACTG 


GTGCTAAGGA 


CCGTTGCCGA 


CGTAAATCTT 


GTCGTAATCC 


201 


GCCAGATCCG 


GTTAACGGCA 


TGGTGCATGT 


GATCAAAGGC 


ATCCAGTTCG 


251 


GTTCCCAAAT 


TAAATATTCT 


TGTACTAAAG 


GTTACCGTCT 


GATTGGTTCC 


301 


TCCAGCGCTA 


CATGCATCAT 


CTCTGGTGAT 


ACTGTCATTT 


GGGATAATGA 


351 


AACACCGATT 


TGTGACCGAA 


TTCCGTGTGG 


TCTGCCGCCG 


ACCATCgCCA 


401 


ACGGTGATTT 


CAcCTCTAtC 


AgTCGCGAGt 


ATTTTCACTA 


TGGTTCTGTG 


451 


GTGACCTACC 


aCTGCAATCt 


GGGTAGCcGT 


GGTaaaAAGG 


TGTTTGAGCT 


501 


CGTGGGTGAG 


CCGTCCATCT 


ACTGCACTAG 


CAAaGACGAT 


CAAGTGGGCA 
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551 TCTGGAGCGG CCCGGCACCG CAGTGCATCA TCCCGAACAA A 
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SEQ ID NO: 18 (CM7 ) 

1 MQCNAPEWLP FARPTNLTDE FEFPIGTYLN YECRPGYSGR PFSIICLKNS 
. 51 VWTGAKDRCR RKSCRNPPDP VNGMVHVIKG IQFGSQIKYS CTKGYRLIGS 
101 SSATCHSGD TVIWDNETPI CDRIPCGLPP TIANGDFTSI SREYFHYGSV 
151 VTYHCNLGSR GKKVFELVGE PSIYCTSKDD QVGIWSGPAP QCIIPNK 

SEQ ID NO: 19 (CM8 DNA) 

1 ATGCGAATTC CGTGTGGTCT GCCGCCGACC ATCgCCAACG GTGATTTCAc 
51 CTCTAtCAgT CGCGAGtATT TTCACTATGG TTCTGTGGTG ACCTACCGCT 
101 GCAATCCGGG TAGCGGTGGT CGTAAGGTGT TTGAGCTCGT GGGTGAGCCG 

201 GGCAr^ ACT GCACTAGTAA ^GATCAA GTGGGCATCT GGAGCGGCCC 
201 GGCACCGCAG TGCATCATCC CGAA 

SEQ ID NO: 20 (CM8) 

1 MRIPCGLPPT IANGDFTSIS REYFHYGSW TYRCNPGSGG RKVFELVGEP 
51 SIYCTSNDDQ VGIWSGPAPQ CIIPNK 

SEQ ID NO: 21 (CM9 DNA) 

1 ATGCGAATTC CGTGTGGTCT GCCGCCGACC ATCACCAACG GTGATTTCAT 
51 CTCTACCAAT CGCGAGAATT TTCACTATGG TTCTGTGGTG ACCTACCaCT 
101 GCAATCtGGG TAGCcGTGGT aaaAAGGTGT TTGAGCTCGT GGGTGAGCCG 

201 1 T ccTr CTACT GCACTAGTAA T ^GATCAA GTGGGCATCT GGAGCGGCCC 
2 01 GGCACCGCAG TGCATCATCC CGAA 

SEQ ID NO: 22 (CM9) 

1 MRIPCGLPPT ITNGDFISTREN RENFHYGSW TYHCNLGSRG KKVFELVGEP 
51 SIYCTSNDDQ VGIWSGPAPQ CIIPNK 

SEQ ID NO: 23 (CM10 DNA) 

1 ATGCGAATTC CGTGTGGTCT GCCGCCGACC ATCACCAACG GTGATTTCAT 
51 CTCTACCAAT CGCGAGAATT TTCACTATGG TTCTGTGGTG ACCTACCGCT 
101 GCAATCCGGG TAGCGGTGGT CGTAAGGTGT TTGAGCTCGT GGGTGAGCCG 
15 TCCATCTACT GCACTAGcAA aGACGATCAA GTGGGCATCT GGAGCGGCCC 
201 GGCACCGCAG TGCATCATCC CGAA 
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SEQ ID NO:24 (CM10) ' 

1 MRIPCGLPPT ITNGDFISTREN RENFHYGSW TYRCNPGSGG RKVFELVGEP 
51 SIYCTSKDDQ VGIWSGPAPQ CIIPNK 

SEQ ID NO : 2 5 (CM12 DNA) 

1 ATGCGAATTC CGTGTGGTCT GCCGCCGACC ATCgCCAACG GTGATTTCAc 

51 CTCTAtCAgT CGCGAGtATT TTCACTATGG TTCTGTGGTG ACCTACCGCT 

101 GCAATCCGGG TAGCGGTGGT CGTAAGGTGT TTGAGCTCGT GGGTGAGCCG 

151 TCCATCTACT GCACTAGcAA aGACGATCAA GTGGGCATCT GGAGCGGCCC 

201 GGCACCGCAG TGCATCATCC CGAA 

SEQ ID NO: 26 (CM12) 

1 MRIPCGLPPT ITNGDFISTREN RENFHYGSW TYHCNLGSRG KKVFELVGEP 
51 SIYCTSKDDQ VGIWSGPAPQ CIIPNK 

SEQ ID NO: 27 (CM13 DNA) 

1 ATGCGAATTC CGTGTGGTCT GCCGCCGACC ATCACCAACG GTGATTTCAT 

51 CTCTACCAAT CGCGAGAATT TTCACTATGG TTCTGTGGTG ACCTACCaCT 

101 GCAATCtGGG TAGCcGTGGT aaaAAGGTGT TTGAGCTCGT GGGTGAGCCG 

151 TCCATCTACT GCACTAGcAA aGACGATCAA GTGGGCATCT GGAGCGGCCC 

2 01 GGCACCGCAG TGCATCATCC CGAA 

SEQ ID NO: 28 (CM13) 

1 MRIPCGLPPT I ANGDFTS I S REYFHYGSW TYRCNPGSGG RKVFELVGEP 
51 SIYCTSKDDQ VGIWSGPAPQ CIIPNK 

SEQ ID NO: 29 (CM14- DNA) 

1 ATGCGAATTC CGTGTGGTCT GCCGCCGACC ATCgCCAACG GTGATTTCAc 

51 CTCTAtCAgT CGCGAGtATT TTCACTATGG TTCTGTGGTG ACCTACCaCT 

101 GCAATCtGGG TAGCcGTGGT aaaAAGGTGT TTGAGCTCGT GGGTGAGCCG 

151 TCCATCTACT GCACTAGcAA aGACGATCAA GTGGGCATCT GGAGCGGCCC 

201 GGCACCGCAG TGCATCATCC CGAA 

SEQ ID NO:30 (CM14) 

1 MRIPCGLPPT I ANGDFTS I S RE YFK YGS W TYHCNLGSRG KKVFELVGEP 
51 SIYCTSKDDQ VGIWSGPAPQ CIIPNK 
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SEQ ID NO: 31: 

Met Gin Cys Asn Ala Pro Glu Trp Leu Pro Phe Ala Arg Pro Thr Asn 

15 



10 



Leu Thr Asp Glu Phe Glu Phe Pro He Gly Thr Tyr Leu Asn Tyr Glu 



20 



25 



30 



Cys Arg Pro Gly Tyr Ser Gly Arg Pro Phe Ser He He Cys Leu Lys 



35 



40 



45 



Asn Ser Val Trp Thr Gly Ala Lys Asp Arg Cys Arg Arg Lys Ser Cys 
50 



Arg Asn Pro Pro Asp Pro Val Asn Gly Met Val His Val I 



65 



70 



75 



le Lys Gly 
80 



He Gin Phe Gly Ser Gin 
85 



He Lys Tyr Ser Cys Thr Lys Gly 



90 



Tyr Arg 
95 



Leu He Gly Ser Ser S 
100 



er Ala Thr Cys He He Ser Gly Asp 



Thr Val 



105 



110 



He Trp Asp Asn Glu Thr Pro He Cys Asd 



115 



Cys Asp Arg He Pro Cys Gly 



Leu 



120 



125 



Pro Pro Thr He Ala Asn Gl 



130 



y Asp Phe Thr Ser He Ser Arg Glu 



Tyr 



135 



140 



Phe His Tyr Gly Ser Val Val 



145 



Thr Tyr His Cys Asn Leu Gly Ser Arg 



150 



155 



160 



Gly Lys Lys Val Phe Glu Leu Val Gly Glu Pro Ser He Tyr Cys Thi 



165 



170 



175 



Ser Lys Asp Asp Gin Val Gly He Trp Ser Gly Pro Ala Pro Gin Cy £ 
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180 



185 



190 



lie lie Pro Asn Lys Cys 
195 



SEQ ID NO: 32; 



Met Gin Cys Asn Ala Pro Glu Trp Leu Pro Phe Ala Arg Pro Thr Asn 
15 10 15 

Leu Thr Asp Glu Phe Glu Phe Pro lie Gly Thr Tyr Leu Asn Tyr Glu 
20 25 30 

Cys Arg Pro Gly Tyr Ser Gly Arg Pro Phe Ser lie lie Cys Leu Lys 
35 40 45 

Asn Ser Val Trp Thr Gly Ala Lys Asp Arg Cys Arg Arg Lys Ser Cys 
50 55 60 

Arg Asn Pro Pro Asp Pro Val Asn Gly Met Val His Val He Lys Gly 
65 70 75 80 

He Gin Phe Gly Ser Gin He Lys Tyr Ser Cys Thr Lys Gly Tyr Arg 
85 90 95 

Leu lie Gly Ser Ser Ser Ala Thr Cys lie lie Ser Gly Asp Thr Val 
100 105 110 

He Trp Asp Asn Glu Thr Pro He Cys Asp Arg He Pro Cys Gly Leu 
115 120 125 

Pro Pro Thr He Ala Asn Gly Asp Phe Thr Ser lie Ser Arg Glu Tyr 
130 135 140 



Phe His Tyr Gly Ser Val Val Thr Tyr His Cys Asn Leu Gly Ser Arg 
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"5 iso 



155 160 



Gly Lys Lys Val Phe Glu Leu Val Gly Glu Pro Ser He Tyr Cys Thr 
165 170 i7 5 

Ser Lys Asp Asp Gin Val Gly He Trp Ser Gly Pro Ala Pro Gin Cys 

180 185 190 

He He Pro Asn Lys Cys-S-S-Cys Asp Gly Pro Lys Lys Lys Lys Lys 
195 I I 200 205 
C0 2 H CONH 2 

Lys Ser Pro Ser Lys Ser Ser Gly (N-Myristoyl) 
210 215 

SEQ ID NO 33: 

Gly-Ser-Ser-Lys-Ser-Pro-Ser-Lys-Lys-Lys-Lys-Lys-Lys-Pro-Gly-Asp-Cys-NH 2 
SEQ ID NO 34 : 

GCACCGCAGTGCATCATCCCGAACAAATGCTAATAAA 
SEQ ID NO 35 : 

AGCTTTTATTAGCATTTGTTCGGGATGATGCACTGCG 
SEQ ID NO: 36: 

Met Gin Cys Asn Ala Pro Glu Trp Leu Pro Phe Ala Arg Pro Thr Asn 
1 5 10 15 

Leu Thr Asp Glu Phe Glu Phe Pro He Gly Thr Tyr Leu Asn Tyr Glu 
20 25 30 

Cys Arg Pro Gly Tyr Ser Gly Arg Pro Phe Ser He He Cys Leu Lys 
35 40 45 

Asn Ser Val Trp Thr Gly Ala Lys Asp Arg Cys Arg Arg Lys Ser Cys 
50 55 60 
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Arg Asn Pro Pro Asp Pro Val Asn Gly Met VaJ. His Val lie Lys Gly 
65 70 75 80 

lie Gin Phe Gly Ser Gin lie Lys Tyr Ser Cys Thr Lys Gly Tyr Arg 
85 90 95 

Leu He Gly Ser Ser Ser Ala Thr Cys He lie Ser Gly Asp Thr Val 
100 105 110 

lie Trp Asp Asn Glu Thr Pro lie Cys Asp Arg lie Pro Cys Gly Leu 
115 120 125 

Pro Pro Thr lie Thr Asn Gly Asp Phe He Ser Thr Asn Arg Glu Asn 
130 135 140 

Phe His Tyr Gly Ser Val Val Thr Tyr Arg Cys Asn Pro Gly Ser Gly 
145 150 155 160 

Gly Arg Lys Val Phe Glu Leu Val Gly Glu Pro Ser lie Tyr Cys Thr 
165 170 175 

Ser Asn Asp Asp Gin Val Gly lie Trp Ser Gly Pro Ala Pro Gin Cys 
180 185 190 

lie lie Pro Asn Lys Cys 
195 

SEQ ID NO:37 (CM15) : 

1 MQCNVPEWLP FARPTNLTDD FEFPIGTYLN YECRPGYSGR PFSIICLKNS 

51 VWTSAKD KCK RKSCRNPPDP VNGMAH V I KD IQFRSQIKYS CPKGYRLIGS 

101 SSATCIISGN TVIWDNKTPV CDRIICGLPP TIANGDFTSI SREYFHYGSV 

151 VTYHCNLGSR GKKVFELVGE PSIYCTSKDD QVGIWSGPAP QCIIPNK 
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SEQ ID NO:38 (CM15/cys): 

1 MQCNVPEWLP FARPTNLTDD FEFPIGTYLN YECRPGYSGR PFSIICLKNS 

51 VWTSAKDKCK RKSCRNPPDP VNGMAHVIKD IQFRSQIKYS CPKGYRLIGS 

101 SSATCIISGN TVIWDNKTPV CDRIICGLPP TIANGDFTSI SREYFHYGSV 

151 VTYHCNLGSR GKKVFELVGE PSIYCTSKDD QVGIWSGPAP QCIIPNKC 

SEQ ID NO: 39 

1 [MQCNVPEWLP FARPTNLTDD FEFPIGTYLN YECRPGYSGR PFSIICLKNS 

51 VWTSAKDKCK RKSCRNPPDP VNGMAHVIKD IQFRSQIKYS CPKGYRLIGS 

101 SSATCIISGN TVIWDNKTPV CDRIICGLPP TIANGDFTSI SREYFHYGSV 

151 VTYHCNLGSR GKKVFELVGE PSIYCTSKDD QVGIWSGPAP QCIIPNK] -Cys-S-S-Cys- 

I I 
C0 2 H CONH 2 

-Asp Gly Pro Lys Lys Lys Lys Lys Lys Ser Pro Ser Lys Ser Ser Gly (N- 
Myristoyl) 
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Claims 

1. A soluble polypeptide comprising, in sequence, one to four short consensus 
repeats (SCR) selected from SCR 1, 2, 3 and 4 of long homologous repeat A (LHR-A) as 
the only structurally and functionally intact SCR domains of CR1 and including at least 
SCR3, in which one or more of the native amino acids are substituted with the following: 
Val 4, Asp 19, Ser 53, Lys 57, Ala 74, Asp 79, Arg 84, Pro 91, Asn 109, Lys 1 16, Val 
119, Ala 132, Thr 137, He 139, Ser 140, Tyr 143, His 153, Leu 156, Arg 159, Lys 161, 
Lys 177, Gly 230, Ser 235, His 236. 

(Numbering is from glutamine as residue 1 of mature CR1. The amino-acid indicated is 
that which replaces the CR1 residue at the position specified.) 

2. A polypeptide according to claim 1 which comprises, in sequence, SCR 1, 2, 3 
and 4 -of LHR-A or SCR. 1, 2 and 3 of LHR-A as the only structurally and functionally 
intact SCR domains of CR1. 

3. A polypeptide according to claim 1 or 2 comprising the native interdomain 
sequences in CR1 optionally substituted with the corresponding predicted aminoacids in 
the CRl-like sequence, namely Lys59 and/or Ilel24. (Numbering is from glutamine as 
residue 1 of mature CRL The amino-acid indicated is that which replaces the CR1 
residue at the position specified.) 

4. A polypeptide according to claim 2 or 3 of formula (I): 

NH 2 -Vl-SCRl-Wl-SCR2-X 1 -SCR3-Yl-OH (I) 
in which SCR1 represents residues 2-58 of mature CR1, SCR2 represents residues 63- 
120 of mature CR1, SCR3 represents residues 125-191 of mature CR1, and containing at 
least one of the substitutions as aforesaid and V*, W*, X* and represent bonds or 
short linking sequences of amino acids, preferably 1 to 5 residues in length and which are 
preferably derived from native interdomain sequences in CR1, optionally substituted as 
defined in claim 3. 

5. A polypeptide according to claim 4 wherein W*, X* and represent residues 
59-62, 121-124 and 192-196, respectively, of mature CR1, optionally substituted as 
defined in claim 3, and V* represents residue 1 of mature CR1 optionally linked via its 
N-terminus to methionine. 

6. A polypeptide according to claim 2 or 3 of formula (II): 

NH 2 "V 2 -SCRl-W 2 -SCR2-X 2 -SCR3-Y 2 -SCR4-Z 2 OH (II) 
in which SCR1, SCR2 and SCR3 are as hereinbefore defined, SCR4 represents residues 
197-252 of mature CR1 and containing at least one of the substitutions as aforesaid, and 
V 2 , W 2 , X 2 , Y 2 and Z 2 represent bonds or short linking sequences of amino acids, 
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preferably 1 to 5 residues in length and which are preferably derived from native 
interdomain sequences in CR1, optionally substituted as defined in claim 3. 

7. A polypeptide according to claim 6 wherein W 2 , X! 2 , Y 2 and Z 2 represent 
residues 59-62, 121-124, 192-196, and residues 253 respectively, of mature CR1, 
optionally substituted as defined in claim 3, and V 2 represents residue 1 of mature CR1 
optionally linked via its N-terminus to methionine. 

8. A polypeptide according to claim 1 or 3 of formula (III): 

NH 2 -X 3 -SCR3-Y 3 -OH (III) 
in which SCR 3 is as hereinbefore defined, containing at least one of the substitutions as 
aforesaid, and in a preferred embodiment, all those of Sequence Group 1, and X 3 and Y 3 
represent bonds or short linking sequences of amino acids, preferably 1 to 5 residues in 
length and which are preferably derived from native interdomain sequences in CR1, 
optionally substituted as defined in claim 3. 

9. A polypeptide according to claim 8 wherein X 3 represents amino acids 122-124 
of mature CR1, optionally substituted as defined in claim 3, optionally linked to 
methionine at its N-terminus and Y 4 represents amino acids 192-196 of mature CRL 

10. A polypeptide according to claim 1 or 3 of formula (IV): 

NH 2 -X 4 -SCR3-Y 4 -SCR4-Z 4 -OH (IV) 
in which SCR3 and SCR4 are as hereinbefore defined containing at least one of the 
substitutions as aforesaid and X 4 , Y 4 and Z 4 represent bonds or short linking sequences 
of amino acids, preferably 1 to 5 residues in length and which are preferably derived 
from native interdomain sequences in CR1, optionally substituted as defined in claim 3. 

11. A polypeptide according to claim 10 wherein X 4 represents amino acids 122-124 
of mature CR1, optionally substituted as defined in claim 3, optionally linked to 
methionine at its N-terminus and Y 4 and Z 4 represent amino acids 192-196 and 253 
respectively of mature. CR1. 

12. A polypeptide according to any preceding claim wherein the SCR3 domain is 
substituted with all ten residues found in the corresponding pseudogene sequence, namely 
(in single letter code): 

A132, T137, 1139, S140, Y143, H153, L156, R159, K161, K177 (Sequence Group 1) 
and the remaining domains have the sequence of mature CRL 

13. A polypeptide according to claim 1 selected from SEQ ID NOs: 8, 10, 12, 14, 16, 
18, 20, 22, 24, 26, 28, 30 and 37. 

14. A soluble derivative of the soluble polypeptide of any preceding claim, said 
derivative comprising two or more heterologous membrane binding elements with low 
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membrane affinity covalently associated with the polypeptide which elements are capable 
of interacting independently and with thermodynamic additivity with components of 
cellular membranes exposed to extracellular fluids. 

15. A derivative according to claim 16 compring two to eight membrane binding 
elements selected from: fatty acid derivatives; ligands of known integral membrane 
proteins; sequences derived from the complementarity-determining region of monoclonal 
antibodies raised against epitopes of membrane proteins; membrane binding sequences 
identified through screening of random chemical libraries. 

16. A derivative according to claim 14 or 15 having the following structure: 
[PML-[W]} n -X 

in which; 

P is the soluble polypeptide, 

each L is independently a flexible linker group, 

each W is independently a peptidic membrane binding element, 

n is an integer of 1 or more and 

X is a peptidic or non-peptidic membrane-binding entity which may be covalently 
linked to any W. 

17. A polypeptide derivative which is SEQ ID NO:32 or 39. 

18. The polypeptide portion of a derivative according to any of claims 14 to 17. 

19. A polypeptide portion according to claim 18 which is SEQ ID NO:31 or 38. 

20. A process for preparing a polypeptide according to any of claims 1 to 13 which 
process comprises expressing DNA encoding said polypeptide in a recombinant host cell 
and recovering the product. 

21. A DNA polymer comprising a nucleotide sequence that encodes the polypeptide 
of any of claims 1 to 13, 18 or 19. 

22. A DNA polymer according to claim 21 selected from SEQ ID NOs: 7, 9, 1 1, 13, 
15, 17,19,21,23,25,27 and 29. 

23. A replicable expression vector capable, in the host cell, of expressing the DNA 
polymer of claim 21 or 22. 

24. A host cell transformed with a replicable expression vector of claim 23. 

25. A process for preparing a derivative according to any of claims 14 to 17 which 
process comprises expressing DNA encoding the polypeptide portion of said derivative in 
a recombinant host cell and recovering the product and thereafter post translationally 
modifying the polypeptide to chemically introduce membrane binding elements. 
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26. A pharmaceutical composition comprising a therapeutically effective amount of a 
polypeptide or derivative of any of claims 1 tol7, and a pharmaceutically acceptable 
carrier or excipient. 

27. A method of treating a disease or disorder associated with inflammation or 
inappropriate complement activation comprising administering to a subject in need of 
such treatment a therapeutically effective amount of a polypeptides derivative of any of 
claims 1 to 17. 

28. The use of a polypeptide or derivative of any of claims 1 to 17 in the manufacture 
of a medicament for the treatment of a disease or disorder associated with inflammation 
or inappropriate complement activation. 
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